SPARSE EQUIDISTRIBUTION PROBLEMS, PERIOD BOUNDS, 

AND SUBCONVEXITY 



AKSHAY VENKATESH 



Abstract. We introduce a "geometric" method to bound periods of automor- 
phic forms. The key features of this method are the use of equidistribution 
results in place of mean value theorems, and the systematic use of mixing and 
the spectral gap. Applications are given to equidistribution of sparse subsets of 
horocycles and to equidistribution of CM points; to subconvexity of the triple 
product period in the level aspect over number fields, which implies subcon- 
vexity for certain standard and Rankin-Selberg L-functions; and to bounding 
Fourier coefficients of automorphic forms. 
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1. Introduction. 

1.1. General introduction. Let r C G be a lattice in an S-arithmetic group. Let 
Y C T\G be a subset endowed with a probability measure v, and / a function on 



T\G. Fixing a basis {ip^} for L 2 (Y,is), we shall refer to the numbers J fip^'dv, 
as the periods of / along Y . Evidently, the periods depend heavily on the choice of 
basis for L 2 (Y, v). They play a major role in the theory of automorphic forms, in 
significant part because they often express information about L-functions. 

The present paper is centered around a geometric method yielding upper bounds 
for these periods. It is applicable, roughly speaking, when considering the periods of 
a fixed function / along a sequence of subsets (Yl, z/,), with the property that the Yi 
are becoming equidistributed; that is to say, the vi approach weakly the G-invariant 
measure on T\G. The key inputs of this method are, firstly, the equidistribution of 
the Vi, and secondly, the mixing properties of certain auxiliary flows. More precisely, 

l 
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we shall need these properties in a quantitative form; in the cases we consider, this 
will follow eventually from an appropriate spectral gap. 

This situation might seem rather restrictive. However, it arises often in many 
natural equidistribution questions ("sparse equidistribution problems," as we dis- 
cuss below) as well as in the analytic theory of automorphic forms (especially, sub- 
convexity results for L-functions) . There are applications besides those discussed 
in the present paper; our aim has not been to give an exhaustive discussion, but 
rather just to present a representative sample of interesting cases. We shall explain 
the method abstractly in Sec. II. 31 and will carry out, in the body of the paper, one 
example of each of the following cases: Yi is the orbit of a unipotcnt, a semisimplc, 
and a toral subgroup of G. 

In the present paper, we have focused mostly on the case of PGL2 and GL2 over 
number fields. All our results pertain to this setting, except for Thm. 13.21 which 
applies to a general semisimple group. The geometric methods of this paper are 
general and we hope to analyze further higher rank examples in a future paper. 

Throughout the present methods we have tried to use "soft" techniques as a 
substitute for explicit spectral expansions. However, there still seem to be instances 
where the explicit spectral expansions are important. In a future paper |23) . joint 
with P. Michel, we shall combine ideas drawn from this paper with ideas from 
Michel's paper JSJ; in that paper, we shall make much more explicit use of spectral 
decomposition. 

We shall use the term "sparse equidistribution problems" to describe questions of 
the following flavor: Suppose Zi C Y% is a subset endowed with a measure vf , and 
we would like to prove that the vf are becoming equidistributed. In other words, 
we wish to deduce the equidistribution of the "sparse" subset Zi from the known 
equidistribution of Yi- Examples of this type of question are Shah's conjecture 
[3"2] (where the Zi are discrete subsets of Yi, a full horocycle orbit) as well as 
Michel's results on subsets of Heegner points [221 (where the Zi are subsets of the 
Yi, the set of all Heegner points.) The connection to period integrals is as follows: 
one can spectrally expand the measure vf in terms of the basis for L 2 (Yi,vi). 
Using our results for periods along Yi, it will sometimes be possible to deduce the 
equidistribution of vf . 

We now briefly summarize our results. 

(1) Sec. |31 considers where the YiS are orbits, or pieces of orbits, of unipotent 
groups. The mixing flow is the horocycle flow along Yi. 

In Thm. 13.11 (p. 124(1 we show that certain sparse subsets of horocycles 
on compact quotients of SL2(K) become equidistributed. This is progress 
towards a conjecture of N. Shah. In Thm. 13.21 (p. I27fl we give a fairly 
general bound (in the context of an arbitrary semisimple group) on the 
Fourier coefficients of automorphic forms. In the case of G — SL2QR) it 
recovers results of Good ^Sj and Sarnak [23], which resolved a problem of 
Selberg. The present proof is more direct, avoiding in particular the triple 
product bounds for eigenfunctions. 

(2) Sec. ^considers the case when G = PGL^F <£> R), where F is a number 
field, and T is a congruence subgroup thereof. The Yi are a sequence of 
closed diagonal G-orbits on Y\G x T\G. The mixing flow (after lifting to 
the adeles) is the diagonal action of PGL2(Ai?/), where Apj is the ring of 
finite adeles of F. 
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Prop. 14. II and Prop. 14. 21 give period bounds in this context. We refer to 
Prop. 14. li as a subconvex bound for the triple product period, for the reason 
that it should be in fact be equivalent to subconvexity for the triple prod- 
uct L-function, in the level aspect as one factor varies, but the necessary 
computation of p-adic integrals (Hypothesis lll.lfl has apparently not yet 
been done in sufficient generality. In Thm. 15.11 (p. I35jl it is shown that 
these results yield subconvex bounds, in the level aspect, for standard and 
Rankin-Selberg L-functions attached to PGL2. 

The results on standard and Rankin-Selberg L-functions generalize re- 
sults of Duke-Friedlander-Iwaniec and Kowalski-Michel-Vanderkam from 
the case F = Q. 1 The third result, concerning subconvexity of the triple 
product period in the level aspect, was not known even over Q; however, 
Bernstein and Reznikov have shown subconvexity for the triple product 
period in the eigenvalue aspect. 

(3) Sec. considers the case when Yj, is a certain family of noncompact torus 
orbits on T\G, where (r, G) is as in Sec. 0| (In fact, the Yi are obtained by 
taking a fixed noncompact torus orbit, and translating by a p-adic unipo- 
tent, where p varies.) The mixing flow is the action of the adelic points of 
the torus. 

We establish in Thm. 16.11 (p. I4(J|> subconvexity for character twists 
of GL(2) in the level aspect. This was established for F — Q by Dukc- 
Friedlander-Iwaniec, and the special case where F is totally real and the 
form holomorphic at all infinite places was treated by Cogdell, Piatetski- 
Shapiro and Sarnak. In particular, 16.2(1 gives a subconvex bound for 
Grossencharacter L-functions over F, in the level aspect; this was known 
over Q by work of Burgess, and some special cases were known in the general 
case. 

(4) In Sec. 0we consider the case where Yi is a (union of) compact torus orbits 
on r\G, where (r, G) are as in Sec. 0] The equidistribution of such Yi will 
amount to the equidistribution of Heegner points, and we deduce it from 
Thm. 16. II in Thm. 17.11 Tp. 1471) . This result generalizes work of Duke over Q 
and was proven, conditionally on GRH, by Zhang, Cohen, and Ullmo-Clozcl 
(independently). The present work makes this result unconditional. 

Applying mixing properties of the adelic torus flow, we obtain in Thm. 
17.21 fr>. |4"5|) we obtain, under a condition of splitting of enough small primes, 
the equidistribution of certain sparse subsets of Heegner points. In the case 
F = Q, an unconditional result of this nature is due to Michel. 2 



We have not attempted to address the issue of varying the central character. This, in a sense, 
is the most subtle point, as is shown by Michel's recent work on Rankin-Selberg convolutions. Our 
aim in the present paper has been to show that one can derive a coherent theory for PGL2 from 
the triple product bound of Prop. 14.11 The case of varying central character will be discussed in 
a future paper with Michel. 

2 Our method is different to Michel's: we do not deduce our result from results on Rankin- 
Selberg convolutions, and indeed it is possible to deduce a subconvexity result from ours. However, 
there seem to be some curious parallels between the methods. In fact, the method of Thm. I7.2l is 
even more closely related - as Michel has pointed out to me - to the work 1101 of Duke, Friedlander 
and Iwaniec. In that paper they amplify class group L-functions but obtain only a conditional 
result for precisely the same reason that Thm. 17. 21 fails to be unconditional, namely, one cannot 
guarantee unconditionally the existence of enough small split primes. 
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In that context of L- functions, one pleasing feature of the present method is 
that it is geometric: it proceeds not via Fourier coefficients but via the integral 
representation. In practice, this means that there is no difference between Maass 
or holomorphic forms, nor between Q and an arbitrary base field. Moreover, we 
do not make use of either the trace formula or the Kuznetsov formula; indeed, we 
make no explicit use of families. 

The recent work of Bernstein-Reznikov [2j is of a similar flavor. They establish a 
"subconvex" bound for the triple product when the eigenvalue of one factor varies, 
whereas we have treated the case where the level of one factor varies. Their method 
is also geometric in nature, and moreover their result applies to a nonarithmetic 
group. By contrast, the level aspect question is not well-posed if one leaves the 
arithmetic setting. 

Throughout the paper we have not attempted to optimize the results. The input 
to our method is an equidistribution result. As far as possible we have tried to 
establish these results by relatively "geometric" methods, deriving in the end from 
the mixing properties of a certain flow. Of course, it is in many contexts better to 
use spectral methods, but this would involve departing from the geometric method 
that is intended to be the central theme of this paper. As remarked, we will pursue 
such "spectral" approaches in a forthcoming paper with P. Michel [23]; some of the 
results of this have been discussed in 

Finally, implicit in various parts of the paper is "adelic analysis" , i.e. the analytic 
theory of functions on adelic quotients, in the quantitative sense needed for analytic 
number theory. There seems to be considerable scope to develop this theory fully. 

1.2. Other applications. The method of this paper has other applications not 
elaborated here. We discuss some of them here. 

There are other subconvexity results that are naturally approached by the same 
method: for instance, a subconvex estimate for L(w, h + it) where t varies and ir is 
a fixed cuspidal representation of GL(2) over a number field F. In such a context it 
is natural to use the fact that the horocycle flow is (quantifiably) weakly fc-mixing, 
for certain k > 1; the use of this higher order mixing is closely related to Weyl's 
"successive squaring" approach to ((1/2 + it). Of course, this particular instance of 
subconvexity is approachable by standard methods also; an intriguing question in 
the subconvexity context is how to combine the present methods with those such 
as Bernstein-Reznikov. 

There are certain applications to effective equidistribution theorems: for in- 
stance, it is also possible to establish some new effective cases of Ratner's theorem 
by the same ideas, see Rem. 13.11 The question of giving such "nontrivial" cases 
was raised by Margulis in his talk at the American Institute of Mathematics, June 
2004. Unfortunately, these new cases are rather artificial. 

One can give certain analytic applications: let T be a cocompact subgroup of 
SL(2,tt), and let it C L 2 (r\SL(2, K)) be an irreducible SL(2, R)-subrepresentation. 
For m € Z, let e m be the mth weight vector in 7r, if defined; i.e. a vector which 



it (up to a complex scalar of absolute value 1) by requiring that ||e m ||£2 = I. 
Bernstein and Reznikov proved the bound ||e m ||L°o *C (1 + Iml) 1 / 2 , and asked |2 
Remark 2.5(4)] if any improvement of the exponent 1/2 is possible. It is quite easy 
to deduce from Lem. 13. II such a bound; indeed, the analytic properties of the e m , 



transforms under the character 




We normalize 
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as 1 7Ti | — > oo, is connected with the long time behavior of the horocycle flow in the 
same fashion that the analytic behavior of Laplacian eigenfunctions are connected 
to the long time behavior of the geodesic flow. In the time during which this paper 
was being revised for submission, Reznikov has proven independently a result of 
this type gS]. Since the result he obtains is most likely sharper than that obtained 
by the technique indicated above, we will not pursue this further, noting only that 
an advantage of the method we have indicated above is that it is likely to generalize 
to higher rank. 

Moving slightly away from the main subject of the present paper, the idea of 
using equidistribution theorems to produce mean value results for L-functions seems 
capable of application in a variety of settings. In particular, equidistribution results 
are readily available on GL(n), owing to Ratner's work, whereas trace formulae are 
extremely unwieldy for n > 2. It would be interesting to see what mean- value 
statements can be deduced from Ratner-type equidistribution results. 

Historically, one application of such results has been to nonvanishing results; 
here the most spectacular results (e.g. |34|) have been achieved through the so- 
called mollifier technique. It would be quite interesting to understand if there is a 
geometric interpretation of the mollifier technique. 

1.3. Discussion of method: equidistribution, mixing, and periods. We now 

turn to a discussion of the specifics of the method used in this paper. This method 
itself is quite easy to describe. It consists in essence of two simple steps (see (|1.2f) 
and l|1.3fl below). 

We also remark that the discussion that follows is a relatively faithful rendition 
of the method of the paper. The body of this paper does not really utilize any new 
ideas beyond the ones indicated below. Most of the bulk consists of the technical 
details necessary to connect periods with other objects of interest (e.g. equidistri- 
bution questions or L-functions), as well as setting up the machinery to quantify 
some standard equidistribution results. As much as possible, we have tried to give 
a self-contained treatment of all these technical details in Sections II II 

We hope the ensuing discussion serves as a unifying thread for the rest of the 
paper. We explain the method first in an abstract setting (Sec. I1.3.1|l . We then 
explain (Sec. I1.3.2l and ll.3.3|l these ideas in a a more down-to-earth fashion, empha- 
sizing the parallel with the analytic techniques for studying L-functions. Finally, 
Sec. 11.3.41 illustrates these ideas in a simple example - that of Fourier coefficients 
of modular forms. 

1.3.1. Abstract setting. Let G2 C G\ be locally compact groups, r C G\ a lattice, 
X = T\G\. Let Xi G X and put Yj = XiG2- We shall suppose that there exists a 
G2-invariant probability measure Vi on lj. (This does not precisely cover all the 
contexts we consider - at some points we will consider Yi which are "long pieces" of 
a G2-orbit rather than a single G2-orbit, but the ideas in that case will be identical 
to those discussed here). 

Let / be a function on X and ipi a function on Yi such that J Y \il)i\ 2 dvi = 1. We 
will give a bound for the period J Y fipidfi. 

In words, the idea will be to find certain correlations between the values of ip 
at different points; and then show that the values of / at these same points are 
"uncorrected," in some quantifiable sense. Putting these together will show that 
the period must be small. The "hard" ingredient here is some version of the spectral 
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gap, i.e. quantitative mixing, which is what will show that the "uncorrelated-ness" 
property of /. 

We will suppose that there exists a, a measure on G2, such that 



(1.1) 



ipi*a = Xiipi, 



for some A^ G C. Here *er denotes the action of a by right convolution. Let a be 
the image of a by the involution g t— > g^ 1 of G^- 
Then 



(1.2) 



\ 1 J f ■ (i>i * v)dv% 



\ 1 J (/ * cr) ■ ifridVi 
< lAd" 2 j \f**\ 2 d^ 



where we have applied Cauchy-Schwarz at the final step. Now, we are assuming 
that the Yi are becoming equidistributed, and so i/j — > v, the G\ invariant measure 
on T\Gi. Thus 



dvi 



dp 



(1.3) 



(gg'^ 1 ■ f, f)L2(x)da(g)da(g'), 



's,ff'eG 2 

where gg'~ x ■ / denotes the right translate of / by gg'~ l ■ 

If the G2-action on X is mixing in a quantifiable way - i.e., one has strong 
bounds on the decay of matrix coefficients - one obtains good upper bounds on the 
right-hand side of (|1.3(l : in combination with (|1.2J) this gives an upper bound for 
the period | f Y( f^idv t \. 

The strength of the information required about the mixing varies. In the cases 
we study where G2 is amenable, any nontrivial information will suffice. In the one 
case where G2 is semisimple, a strong bound towards Ramanujan is needed. For 
instance, in the case of triple products, we need any improvement of the bound that 
the pth Hecke eigenvalue of a cusp form on GL(2) is bounded in absolute value by 



iV4 



-1/4 



(In this normalization, the trivial bound is p 



1/2 



-l/2\ 



In the rest of this paper, we shall merely apply this argument many times, with 
various different choices for T, G\, G2. The part of the argument which will vary is 
quantifying the equidistribution of the Vi, i.e. keeping track of the error in the first 
approximation of (|1.3fl . Thus we make heavy use of Sobolev norms (Sec. [SJ, which 
are an efficient method of bounding this error. 

In each instance, the proof of the equidistribution result Vi — > v will always be 
rather straightforward, except for the result of Sec. The equidistribution result 
needed for the proof of Thm. 17.21 is essentially equivalent to the subconvexity 
result proved in Sec. El A rather striking point is that a similar logical dependence 
(although manifested very differently) is present in the work of Michel. The meaning 
of this is unclear to the author. 

In certain specific cases, the above technique is quite familiar. When G2 is a 
one-parameter real group, the above argument is quite closely related to standard 
techniques of analytic number theory. 3 On the other hand, when G2 is an adelic 



For example, in certain contexts when G2 is abelian, one can push this method further by 
squaring multiple times, that is to say, considering | J fipidvi] 4 , | J fipidvi\ s and so forth. In this 
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group, and er a measure on G2 that corresponds to the action of Hecke operators 
(this is carried out in Sec. 01 for instance), the above argument will be essentially 
"amplification" in the sense of Friedlander-Iwaniec .12;. 

In the following two sections, we shall attempt to explain more colloquially the 
main idea that is at work here, and also discuss how the method described above 
fits into the framework of analytic number theory. Modern proofs of subconvexity, 
following the path-breaking work of Friedlander-Iwaniec |12| . have roughly speaking 
consisted of a mean-value theorem and an amplification step. We shall discuss how 
the proof indicated above may be viewed as geometrizing this strategy, where the 
mean- value step is replaced by an equidistribution theorem, and the amplification 
step is controlled using mixing. 

Note, in particular, that in the work of Friedlander-Iwaniec, families of L- 
functions play a central role, whereas the method above has in a certain sense 
eliminated the family. Although in the discussion below we rephrase matters so as 
to make clear the connection with the work of Friedlander-Iwaniec, it seems that 
from the perspective of the present paper the phrasing in terms of families is rather 
artificial. 

1.3.2. Connection with analytic number theory: Equidistribution, and mean-value 
theorem for periods. Follow the notations of the previous section. We choose an 
orthonormal basis {ipij}fLi f° r L 2 (Yi, Vi) so that tp^i 4>i- 

By PlancherePs formula, Y^jLi \I f^i.jdvi | 2 = J \f\ 2 dvi. Since Vi — > v weakly, 
and we are holding / fixed, it follows that: 




as i — ► 00 . Thus the equidistribution property of Vi underlies a mean- value theorem 
for the Yi-periods. 

In many cases involving automorphic forms, the periods will essentially be special 
values of L-functions and (|1 -4ft amounts to a mean- value theorem for L- functions. 
This is fairly well-known; for example, the mean- value theorem J_ T \((l/2+it) \ 4 dt ~ 
Tlog(T) 4 is rather closely connected with the equidistribution properties of the cy- 
cle {(1 + i/T)x,x € R}, when projected to SL2(Z)\EL A more striking example 
is Vatsal's use of equidistribution to prove nonvanishing results 38 . In general, it 
seems that there are many interesting mean value theorems for L-functions that 
are connected to equidistribution results. 

In any case, l|1.4[) is not unrelated to the standard methods of obtaining such 
results; however, its primary advantage is that it is often technically much simpler, 
for example when working over a number field. 

1.3.3. Connection with analytic number theory (II): Mixing, and bounds for a single 
period. We now wish to pass from (|1 .4|) to nontrivial upper bounds for a single 
period. It is clear that 1|1.4(1 implies at once - by omitting all terms but one - 
that I J fipijdvi\ ~ ||/||l 2 (x); we shall refer to an improvement of this bound as 



context, one replaces the mixing property of the G2 action with results about higher order mixing 
of the G2-flow. Although we will not carry this out in the present paper, this seems rather closely 
connected to Weyl's proof of subconvexity for C(l/2 + '■*)■ 



8 



AKSHAY VENKATESH 



nontrivial. It is evident that one must have some further information about {tptj} 
in order to do this; otherwise one could simply take to be a multiple of /l^. 

In the context of analytic number theory, this is often carried out by "shortening 
the family," that is to say: proving a sharp mean- value theorem of the form of l|1.4(l . 
but over some subfamily of then omitting all terms but ip^i = ipi will 

often give a nontrivial upper bound. In the work of Friedlander-Iwaniec, a weighted 
mean- value theorem is derived, which has the same effect as shortening the family. 

Such a weighted mean- value theorem is also implicit in our context. Following 
the notation of Sec. 11.3.11 suppose that there is a fixed measure a on G2 such 
that for all i,j, we have ipij * = Xijipi.j (some Ajj € C). Then, by Plancherel's 
formula, and using the fact z/j — > v, we conclude: 




This gives a weighted mean value theorem, which for appropriate choices of a 
amounts to shortening the effective range of summation in (|1.4|) . Moreover, the 
mixing of the G2-fiow bounds the right hand side of ljl.5|l . In this phrasing, it 
becomes clear that the measure a has played the role of an "amplifier" and the 
orthonormal basis for L 2 (Yi, v^) has played the role of the family. 

Having now explained the method in an abstract context and indicated its equiv- 
alence with other methods, we now indicate more informally the source of cancel- 
lation in periods that is at the center of our results. 

In many natural situations, one obtains a basis for L 2 (Yi,b>i) by diagonalizing 
a geometrically defined algebra of operators on Y^, The result of this process is 
that the functions {ipj} exhibit correlations between their values at different points 
of Y. For instance (for example when G2 is semisimple) , it often will occur that 
there is a correspondence C : Y 1— > Y such the value of each ipj at P 6 Y and 
at the collection of points C(P) are correlated in some way. On the other hand 
(and we shall now speak quite imprecisely) if the correspondence C "extends" to a 
correspondence C : X ^ X one can often show, using mixing properties of C, that 
the values of / at P and C(P) will be uncorrelated, at least if P is chosen at random 
w.r.t the the uniform measure on X. 

However, since the Yi are becoming equidistributed, it amounts to almost the 
same thing to choose P at random w.r.t. z/j and w.r.t. the uniform measure on X . 
Thus, for z^-typical P &Yi, the values of / at P and C(P) are uncorrelated, whereas 
the values of ipj at P and C(P) are correlated. One can then play these phenomena 
against each other to obtain cancellation in the period integral J ftpjdvi- 

1.3.4. A concrete example. We shall now discuss how to bound Fourier coefficients 
of a modular form by the methods just described. Although the material below 
is essentially redone - with SL(2,R) replaced by a general group - in Sec. the 
example below was very important in motivating the author's intuition, and it 
seems worthwhile to include it in the introduction. 

Let r C SL(2,R) be a lattice containing the element f J j V Let f(z) be 

a holomorphic form of weight 2 w.r.t. T, which we write in a Fourier expansion 
f( z ) — S^°=i a n e 2mnz . Hecke proved the bound \a n \ < Cn, a bound which was only 
improved (for a general - possibly nonarithmetic - T) much later, to \a n \ < Cn 5 ^ 6 , 
by A. Good. We shall sketch a simple proof of a nontrivial bound |o«| < Cn 1 ' 5 
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along the lines just indicated; for further details, we refer the reader to Sec. 13.21 
where the procedure outlined is implemented for a general semisimple group. 

We note that the ideas that will enter here are exactly those that will enter into 
the proof of equidistribution of sparse subsets of horocycles (see Sec. I3.1|) . or for 
the nontrivial bound for L°° norms in the weight aspect that is discussed in Sec. 
11.21 The proof below also works for Maass forms (in that case the result is due to 
Sarnak) . 

The Fourier expansion implies that 

(1.6) a n = e 2 * [ f(x+-)e 

JxeR/z n 

In words, the idea is as follows: the function e ~ 27rmx takes the same values at 
x, x + —, x + —,... . On the other hand, the values of the function f at these 
points are (in a quantifiable sense) uncorrelated, as we shall deduce from the mixing 
properties of the horocycle flow. Playing these two properties against each other 
will yield an improvement of the Hecke bound for \a n \. 
Let / be the lift of / to T\SL(2,R); that is to say, 



r t~ I a b \ J-/ fl * + ■ , 7\-2 



CI 



( rT 1 ! 2 \ ( \ t\ 

Let x n — F I ^ 1 j 2 I, and put n(t) = ( g \ )' Then ^ ne definitions 

show that f(x n n(t)) = n^ 1 /(^); consequently, we see that 

rn 

(1.7) a n = e 2 * / f{x n n{t))e~ 2nt dt 

l|1.7l) expresses the nth Fourier coefficient of / as the integral of / over a closed 
horocycle of length n. Moreover, (|1.7|) falls into the pattern of Sec. 11.3.11 with G\ = 
SL 2 (R), G 2 = {n(t) : t e E}, Y n = {x n n{t) : t G K.}, and ^ n : Y n -> C the function 
given by x n n(t) i— » er 2vlt . The fact that the Y n are becoming equidistributed 
amounts to the "equidistribution of low horocycles," cf. In the language of 

Sec. 11.3.11 we will take a to be the measure on (?2 = R that is a sum of point 
masses Si, for integers i = 1, . . . ,K. We now carry out the procedure of Sec. 11.3.11 
in an explicit fashion in the paragraphs that follow. 

Let T be the operation of right translation by n(l) on C°°(r\SL2(IR)): that is 
to say, for F e C°°(r\SL 2 (R)), we put TF(g) = F{gn(l)). 

The value of the right-hand side of 11.7fl remains unchanged if we replace / by 
T/; consequently, for any integer K > 1, we have 

a n = % r ( V T l f(x n n(t))e- 2mt dt. 

Applying the Cauchy-Schwarz inequality we deduce that 



K 2 



t=o 



K-l 



T l f(x n n(t)) 



i=0 



dt. 



^Underlying this is the usual "van der Corput" trick: to bound J^^=i c fc ^ suffices to bound 
correlations 5Z^=i c kCk+h', in effect we apply this with cj, = /(— K = n.) 
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We now use come to the equidistribution part of the argument. The equidistribu- 
tion of long closed horocycles asserts that the closed horocycle {x + iy : < x < 1} 
becomes equidistributed in r\H as y — > oo. Quantitatively, for any F S C°°(r\H), 
we have 



(1.9) 



f F(x + iy)dx- [ F 
Jo Jr\M 



< C F y s , 



for some Cf depending on F, and some S depending only on T. This assertion, 
originally proved by Sarnak |3J by spectral methods, can be deduced quite easily 
from the mixing properties of the geodesic flow; this is done, in a somewhat more 
general context, in Lem. 19.61 

We note that - a special case of the discussion in Sec. 11.3.21 - the equidistribution 
statement (|1.9f) above reflects a mean- value theorem for periods. Indeed, if one 
applies it to F = y 2 |/| 2 , one deduces the asymptotic for J2 n <x \ a n\ 2 - 

In any case, what will be more useful is the version of i|1.9fl that is lifted to 
r\SL 2 (M). This asserts that for any F G C°°(r\SL 2 (IR)), we have 



(1.10) 



F(x n n(t))dt- / F(g)dg 
t=o Jr\SL 2 (R) 



< C F n 



-6 



where 5 is an explicit constant depending only on T, and Cf is a constant depending 
on F. 

From i|1.8|) and i|1.10|l we conclude that 



e 47r n 2 



K-l 

-8 



Kl 2 « \ II E r 7ll W\sl 2(r) ) + c f^ n 



i=0 



On the other hand, the explicit derivation of l|1.10fl shows that Cf may be 
bounded by a Sobolev norm of F, and consequently the constant Cf t K that appears 
in (|1.11|) is bounded by Of(K A ) for some A > 0. Thus 



K-l \ 



\a n \ 2 < f n 2 K~ 2 || J2 T7"||i 2(rNSL2(R)) +K A n 



\ i=0 / 

We now use the fact that the horocycle flow is mixing, in a quantifiable way. 
This amounts to the assertion that there is an explicit 5' > and constant such 
that, for i E Z, \(T i f,f)\ < C' f (l + \i\)- 5 ' . It follows easily that 

ii^Vviii^/ K*-r 

i=0 

We conclude that \a n \ < n(K- s ' / 2 +K A / 2 ~ 1 n- s / 2 ). Taking if to be a sufficiently 
small power of n, we conclude that a n is bounded by n 1_<5 for some 5" > 
depending only on T. 

Clearly 5" depends only on the spectral gap of r\SL 2 (IR). Of course, this de- 
pendence does not arise in the "spectral" methods. It can be removed in the above 
method, but this seems to require some extra input, e.g. the finite-dimensionality of 
the space of functionals on an irreducible SL2(M)-representation that are invariant 
under the subgroup {n(t) :(eM}. 
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1.3.5. Two other viewpoints on the method oj \1.3.4\ There are two other viewpoints 
which might be helpful in thinking about Section ll.3.41 Both of these viewpoints 
do not literally generalize to the other situations we consider (e.g. triple products) 
but may be helpful for intuition. 

(1) The first is based on the following simple principle: suppose that T is 
a measure-preserving transformation of the probability space (Y, v) , and 
that T is ergodic. If fa, fi2 are two T-invariant probability measures with 
average Ml + M2 = v, then fa = fat = v\ this follows because v is an extreme 
point of the convex set of T-invariant probability measures. More generally, 
given any family of probability measures averaging to v, they must almost 
all equal v. 

We will apply this to Y = SL2(Z)\SL2(R) and T the operation of trans- 
lation by n(l). 

Let n be large; for t € H./Z, let fa be the probability measure that 
corresponds to normalized counting measure on {x n n(t + k) : k £ Z, < 
k < n}. Here notation is as prior to 1)1.7(1 . 

Then J Q fa is the measure on the closed horocycle {x n n(t) : < t < 
n}. Thus the family of measures fa averages to the measure on a long 
closed horocycle which, as we remarked earlier (see l|1.10fl ) approximates 
the SL/2 (R)-invariant measure dg on r\SL2(K). But this latter measure dg 
is ergodic w.r.t. T. So applying a more quantitative form of the principle 
discussed above shows that, for almost all t E [0, 1], fa must be close to dg. 
It is simple to see that one can use this to deduce bounds for the Fourier 
coefficients, via (ll.7|) . 

(2) We will phrase the second rather imprecisely. Consider i|1.7[) . Our strategy 
of proof can be rephrased as: 

The function t i— > f(x n n(t)) is weak-mixing, whereas the function t i— > 
e 27rlt is periodic, and a weak-mixing function cannot correlate with a peri- 
odic function. 

To explain this statement, we need to explain what it means for a func- 
tion on the real line to be weak-mixing. Consider instead the case of a 
function h : Z — > ML Furstenberg's correspondence principle asserts that 
one can (loosely speaking) associate to this a dynamical system (Y, v, T) in 
such a way that (again loosely speaking) h arises by sampling a function 
on Y along a generic trajectory y ,T(y ),T 2 (y ), .... We then say that 
h is weak-mixing if the system (Y, v, T) is so. The fact that our function 
1 1 ► f(x n n(t)) (say, when restricted to integer times) is weak-mixing follows 
from the equidistribution of long closed horocycles together with the fact 
that the horocycle flow is, itself, mixing. 

For more on this point of view, see e.g. |3fil Section 4] and |36l Lemma 
5.2] for a version of the statement that weak-mixing functions cannot cor- 
relate with periodic ones. 

1.4. Connection to existing methods. The following comments pertain to the 
results of the present paper that concern subconvex bounds for L-functions. As we 
have emphasized above, the methods presented here are, upon examination, seen to 
be closely related to existing methods: in particular, "Sarnak's spectral method," 
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which gave the only hitherto known instance of subconvexity over a base other than 

Indeed, as we have already indicated, the equidistribution step of our method can 
be seen as the geometric version of a mean value theorem, and the rest of the method 
can be seen as an amplification step (or "shortening the family") Nevertheless, the 
key features of the present method are that it is essentially geometric (in that it 
avoids Fourier coefficients) and adelic (which allows us to import much from the 
modern theory of automorphic forms); it also does not use families in any explicit 
way. Once the notation is established - admittedly a nontrivial overhead - the 
method allows for very considerable technical simplification. 

It is perhaps also noteworthy that the method given here does not require any 
exponential decay information for triple products. Although such exponential decay 
information is central only to subconvexity in the eigenvalue aspect, it has thus far 
entered as a technical device even in treatments of the level aspect. 

In a sense, the present method bears the same relation to existing methods as 
adelic methods do to classical methods in the theory of automorphic forms. The 
classical situation has the advantage of concreteness, and whatever can be carried 
out in the adelic setting can be (in principle) carried out in the classical setting. 
However there is often a considerable technical and conceptual advantage in working 
adelically. 

As we have discussed, the connection between equidistribution results and mean- 
value theorems for periods - implicitly exploited throughout this paper - appears 
in the work of V. Vatsal. 

D. Hcjhal considered ideas similar to that of Sec. 13.21 in the context of proving 
bounds towards Fourier coefficients; see [T3|. In the language of this paper, his 
method used a measure a (notation of Sec. 11.3(1 with much larger support, and 
consequently he was unable to get unconditional results. 

Finally, as was remarked in Sec. 11.11 the main result of Sec. 14. H is the analogue 
in the level aspect of a recent result of Bernstein-Reznikov they establish a 
"subconvex bound" on triple products as the eigenvalue of one factor varies. Their 
methods also are geometric, avoiding the use of Fourier coefficients. 

1.5. Acknowledgements. This paper grew out of my proof of Thm. 13.11 The 

original proof was significantly more complicated, and I am indebted to Elon Lin- 
denstrauss for his insistence that Thm. 13.11 should amount to nothing more than 
equidistribution and mixing. It was thus his intuition that led to a simplification of 
the proof and an important step in my understanding. The idea that the methods 
for Thm. 13. II might be applicable in a more general setting arose during conversa- 
tions with Andreas Strombergsson, who also made many valuable suggestions about 
an early version of this paper. I thank them both for their significant contributions. 

I am very grateful to Gergely Harcos and Philippe Michel for their encouragement 
of this project. Philippe read carefully an early draft of this paper and pointed out 
many points where the argument and results could be significantly improved. I am 
also grateful to Peter Sarnak, from whom I learned much of what I know about 
this subject. 

I have also benefited from several conversations with Joseph Bernstein and Andre 
Reznikov. I thank for their generosity in sharing and discussing their elegant ideas. 

I have learned many of the methods that appear here from the work of others. 
I mention in particular Peter Sarnak's paper [2H], which uses the idea of changing 
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the test vector; the Friedlander-Iwaniec idea of amplification and the geometric 
version of it that appears in Bourgain-Lindenstrauss [3; and the recent work of 
Bernstein- Reznikov 1 , in particular their elegant use of Sobolev norms. 

This paper suffered a considerable delay before submission. I would like to 
thank Philippe Michel for his encouragement and insistence that it be revised and 
submitted, without which the delay would have likely been considerably longer. I 
also thank Nicolas Bergeron and Marina Ratner for comments that improved the 
exposition and correctness. 

The ideas of this paper were worked out during the workshop "Emerging appli- 
cations of measure rigidity," AIM, San Francisco and at the Isaac Newton Institute. 

I was supported by the Clay Mathematics Institute during much of the writing of 
the paper, and I thank them for their generous support. I also thank the Institute 
for Advanced Study for providing excellent working conditions during the academic 
year 2005-2006. I was also partially supported by NSF grants DMS-0111298 and 
DMS-0245606. 

1.6. Structure of paper. The logical structure of this paper is as follows: Sec. 
12 introduces all necessary notation. The heart of the paper are Sec. (unipotent 
periods), Sec. 21 (the triple product period), Sections EH and [7| (torus periods). The 
remaining Sections 151- 11 H are of a technical nature, proving various technical results 
required in the main text; at a first reading (or even later) they should perhaps be 
referred to only as necessary. 

The two examples that best convey the flavor of the paper are Theorem 13.11 and 
Prop. 14.11 The proofs of these results are relatively self-contained, and we advise 
that the reader start with them. 

2. Notation. 

2.1. General notation. We use the symbol <§; as is standard in analytic number 
theory: namely, A <C B means that there exists a constant c such that A < cB. 
The notation A *C/, 9 ,h B means that the constant c may depend on the quantities 
/, g, h; the notation A <C e B or A <C £ B will mean, unless otherwise indicated, that 
the stated bound holds for all e or e > 0. In general, we will never explicate the 
dependence of implicit constants on the number field over which we work; and, by 
an abuse of terminology, we will sometimes use the phrase "absolute constant" to 
mean a constant that depends only on this number field. 

If Z is a space we denote by S z the point measure at z € Z, i.e. S z (f) = f(z) for 
/ a continuous function on Z. 

Now let Z be a right G-space. For / a function on Z and g £ G, we write g ■ f 
for the right translate of / by g, i.e. g ■ f(z) = f(zg). If /i is a measure on Z, we 
define the translate g ■ /i by the rule g ■ fi(g ■ /) = fj.(f). In particular, if [i = S z is 
the point mass at z € Z, then g ■ /i = S zg -i is the point mass at zg^ 1 . 

If a is a compactly supported measure on G, we set / * a = J (g ■ f)da(g), 
i.e. f-ka(z) = J G f(zg)da(g). In particular, if S go is the point-mass at go, then 
/ * $ go = 9o ■ / is the right translate of / by go- 
lf (7i , (72 are two compactly supported measures on G, we define the convolution 
<7i*<72 to be the pushforward to G of o\ x ui on G x G, under the multiplication map 
(<7ij S2) eGxGh 3i<?2- Notations as above, one has the (somewhat unfortunate) 
compatibility relation (/ -k a%) * <j\ = / * (<7i * CT2). 



14 



AKSHAY VENKATESH 



For cr a measure on a group G, we denote by a the image of a by the involution 
g i ► g , and by ||cr|| the total variation of a. 

If G is a Lie group, we denote by Ad (5) the endomorphism U X — > gXg~ x " of its 
Lie algebra. 

If £> C ^4 is a finite index subgroup of the group A, then we denote by [A : B] 
the index of B in A. 

If h is an entire function, the notation / ffi ( s )_ CT h(s)ds denotes the line integral 
along the line Sft(s) = cr from cr — zoo to cr+zoo. The notation J^^^ h(s)ds denotes 
fu(s)=cr M s )^ s f° r sufficiently large cr; in the contexts where we use this notation, 
the answer will be constant when a is sufficiently large. 

2.2. Classical modular forms. As usual H denotes the upper half plane, i.e. 
{z e C : Im(z) > 0}. It admits the usual action of SL(2,R) by fractional linear 
transformations. 

2.3. Number fields and associated notations. Let F be a number field. Through- 
out the paper we shall regard F as fixed: that is to say, we allow implicit constants 
in <C, 3> may depend on F without explicit statement. 

We set Foe = F (g> K, Af the ring of adeles of F, Apj the ring of finite adeles. 
Thus Ap = -Foo x Af,/- We will fix once and for all an additive character ep : 
Ap/F — > C, and denote by ep v the induced additive character of F v . 

For each place v we have a canonical "absolute value" n-> \x\ v on F* , namely, 
|a;|„ = meas^SVmeas^) for any Haar measure, meas, on F* , and any subset 5 
of positive measure. 

The same definition defines a character A F /F X — ► M>o, which we denote by 
a i ^ |cs|a, or simply by a. i — > |a| if it is clear from context. We denote by Ap 
the subgroup of A F consisting of adeles of norm 1; then the quotient A F /F X is 
compact. For a finite place v of F, we denote by 0f„ the maximal compact subring 
of the completion F v , by q v the maximal ideal of 0p v , and by q v the cardinality of 
the residue field. 

We shall generally denote ideals of Of by gothic letters t, q,n, etc. If f is an 
integral ideal of Of, we set N(f) := |oF/f| to be its norm. Moreover, we shall denote 
Of := Ylq\f °q- H erc Oq denotes the completed ring, not the localized ring, i.e. Oj is 
the inverse limit of the rings op/f N . 

We denote by d the different of the character ep, i.e. d is a fractional ideal so 
that O^ 1 is, for every finite place v, the largest OF„-submodule of F v upon which 
ep is trivial. 

2.4. Adele groups and their function spaces. Let G be a connected reductive 
algebraic group over a number field F, and let Z be its center. Denote by Apj 
the ring of finite adeles, and fix for each finite place v a maximal open compact 
subgroup K VtG C G(F V ) with the property that if max , G := H v finitc K v G is a 
maximal open compact subgroup of G(A F j). Put X G = G(F)\G(Af), X G ad = 
Z(AF)G(i ;l )\G(AF). Then Xc.ad has finite volume with respect to any G(Af)- 
invariant measure. 

Let oj : Z(Ap) — > C x be a unitary character. We define the space C2°(Xg) to 
be the space of functions on Xg whose stabilizer in K maXi G has finite index, which 
transform under Z(Af) by u>, and so that the function g f{xg) is a C°° function 
of g € G(Foo), for each x € Xg- Similarly one defines an L 2 -space L 2 (Xg), or 
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simply L 2 if the central character ui is clear from context, by completing the space 
of compactly supported functions in C£°(Xg) with respect to the Hilbert norm 

\\fh--= {k G ,jns)\ 2 d 9 y /2 . 

For ip £ C"(Xg), we denote by K v j, the stabilizer of ip in K v g, and put 

(2.1) K i>= ]J K V>M> . 

v finite 

We note that is, in general, a proper subgroup of the stabilizer of ip in if max ,G- 
For ip £ C2°(Xg), we define the finite set of places Supp(i/>) to be those finite v 
for which K v g does not fix ip, i.e. 

(2.2) Supp(V) = {v : K V , G ± K v ^}. 

It is convenient to introduce some notions of "size" on G(Ap). Let g be the 
Lie algebra of G(Foo). It is a finite dimensional real vector space; fix an arbitrary 
norm on it. For g x £ G(F oc ) 1 we denote by ||<?oo|| the operator norm of the adjoint 
endomorphism Ad^^, 1 ) : g — * q. If v is a finite place of F and g v £ G(F V ), 
we set \\g v \\ — [K v ,g9vK v ^g '■ K V} g\, i.e. the number of right- K v ,g cosets in 
K v<G g v K v ,G- For g f = {g v ) v & n ite G G(A FJ ) we put = f[ v \\g v \\. Finally for 
9k = (j9oo,9f) G G(Foo) x G(A F ,/), set ||p A || = || 5oo || • \\g f \\. 

We remark that HSooll) ||<7/||i ||<?a|| are au invariant by the center of G. 

2.5. The groups G = GL(2) and G = PGL(2) and some of their subgroups. 
We will deal most often with the cases of G = GL(2) (resp. G = PGL(2)). In that 
setting we shall write Xql(2) (resp. X) for Xq. 

We will make use of the following algebraic subgroups of GL2, which we will 
often also regard as algebraic subgroups of PGL2 in the obvious way: 

;)■ B -d :)■ z =(» 

If R is any ring and x £ R,y £ R x , we denote 5 

(2.3) »w=(; 1), ww=(i ;), a ( „)=(» ;), 

all elements of GL2(i?). 

If v is a place of F and x £ F v ,y £ F* , we denote by n v (x) (resp. a v (x)) the 
element n(x) (resp. a(y)) considered as an element of GL2(A^) via the natural 
inclusion GL 2 (i^) GL 2 (A F ). 

For each place v, we let K v be the standard maximal compact subgroup of 

GL 2 (F V ), i.e. K v is the stabilizer of the norm on F 2 given by \J \x^ dc& ^ + |y|^ deg ^' 
if v is archimedean, where deg(w) is the degree 6 of F v over R; and max(|a:„|, \y v \) if 
v is nonarchimedean. Thus, in particular, K v = GL2(of.„) if v is nonarchimedean. 

^(In Section II alone, we will use slightly different notation for a(y) to accomodate the fact 
that we deal with SL2 rather than GL2. We make the relevant notation clear in that section. 

^Recall that \x\ v , for a complex place v and x £ F v , is the square of the usual absolute value 
on C! 
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We put -Kmax = Yiv finite -^u- (respectively K max ) is a maximal compact sub- 
group of GL2(-F„) (respectively GL2(Af,/)), and (by projection) can also be re- 
garded as a maximal compact subgroup of PGL2(F„) (respectively PGL 2 (A^/)). 
Similarly K max x is a maximal compact subgroup of GL 2 (Ai?), and may also 
be regarded as a maximal compact subgroup of PGL 2 (Ai?). 

For q a finite prime of F, we denote by w q G F q a uniformizer, and by [w q ] the 
element of A F that is the image of m q under the natural inclusion F q x <^-> A F . 

Let q be a finite prime of F. It will be convenient to define certain open compact 
subgroups of K q . For each e q > 0, we define if [q e "] C K q (resp. if [q 6t| ] C K q ) to 
be the be the kernel of GL2(o q ) — > GL2(o q /'Ci7 q ra ) (resp. the preimage, under this 
map, of the upper triangular matrices). Thus 

K Q [q e «} = " ^ :a,b 7 deo q ,ceq e ",ad-bceo^}. 

Now let f be a fractional ideal, not necessarily prime, of F. Factorize f = J\ I 6 ' 1 
into prime ideals. We define elements [f] £ A F , a([f]), n([f]) £ GL2(Ai?) via: 

(2.4) [f] = nN^, «([fl) :=n^K" eq )- «([fl) = II ^K" 6 " )■ 

q|f q|f q|f 

C x is a character. We define 
0, x ramified at any place dividing f, 

riq|f X( ro q) e "> else - 

2.6. Measures. The choice of measure is not especially important, as we are only 
interested in upper bounds; thus, so long as we are consistent, the precise selection 
does not matter. We choose a "standard" set of measures here; at times in the text, 
especially when carrying out equidistribution arguments, it will be more convenient 
to use probability measures, and we will indicate when this is the case. 

We denote by /xx the PGL2(Ai?)-invariant probability measure on X. We shall 
sometimes simply denote it by dx. 

Let v be a finite place of F. Unless explicitly stated otherwise, the measures 
on GL 2 (-F,,), PGL 2 (i^), F v and F* are the Haar measure which assigns GL 2 (of„) 
(resp. PGL2(of„), 0f v , o f ) the total mass 1. 

For v archimedcan, endow F v with a multiple of Lebesgue measure c v dx, where 
the constants c v are fixed arbitrarily in such a way that the induced product measure 
on satisfies vo^F^/op) = 1; equivalently, the product measure on Ap satisfies 
vol(A F /F) = 1. In particular, this product measure onA f is self-dual with respect 
to ep- We endow F* with the measure d x x = where dx is Lebesgue measure. 

These choices induce a Haar measure on N(F V ), by means of the identification 
x i ► n(x); similarly, the identifications (y,y') i— > a(y)a'(y') and y i— > z(y) induce 
Haar measures on A(F V ) and Z(F V ). Equip K v with the measure of mass 1, and give 
GL2(F„) the measure arising from the Iwasawa decomposition N(F V ) x A(F V ) x K v . 
Equip PGL 2 (-P„) = GL2(F„) / Z (F v ) with the "quotient" measure. 

We then take the measures on GL 2 (Ai?), PGL 2 (Ai?), A^, A F to be the corre- 
sponding product measures. 

The measure on any discrete group (e.g. PGL 2 (F), considered as a subgroup of 
PGL 2 (A^)) will be counting measure. 



Suppose x : A F /F X - 
X(f) = 
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Usually (indeed, unless otherwise specified) we shall use the PGL2(AF)-invariant 
probability measure on X = PGL2(F)\PGL2(Af). This does not coincide with 
the quotient measure induced from PGL 2 (Ai?), but they differ by some constant 
depending only on F. On the few occasions we shall have occasion to use the latter 
measure, we will indicate this. 

2.7. Projection onto locally constant functions. For equidistribution ques- 
tions it is usually convenient to deal with the constant function and its orthogonal 
complement separately. Some minor complications arise in our case since the am- 
bient spaces are not connected. In fact: The space C°°(Xg) is a direct limit of 
function spaces C°°(X.g/K) where K C -Kmax,G has finite index. Unless G is 
simply connected, the manifolds Xc/^f need not be connected. 

Of course, to deal with this, one can (if G is semisimple) simply replaces the 
notion of constant function by locally constant function. However, in the general 
case of G reductive, matters are slightly complicated by the necessity of dealing 
with central characters. 

Since we will only use this definition when G is a product of GL(2)s, we restrict 
ourselves to that setting. First suppose that G = GL(2). We define a projection 
9 : C~(X GL(2) ) C~(X GL(2) ) via 

(2.5) &f(x) = [ f(hx)dh = V X (x) f f(yW)dy, 

JheSL 2 (F)\sh 2 (A F ) x2=u Jx 

where dh is the SL2(A^)-invariant probability measure, dy the GL/2(A^)-invariant 
probability measure on X, x ranges over characters of A F /F X with square u>, x(y) 
the function on X GL (2) defined by g i— » x(det(g)), and the second equality is easily 
verified. We note, in particular, that the x sum is finite (any x f° r which the 
corresponding term is nonvanishing must be unramificd outside Supp(/)). 

Then ||^/||x,oc < as is clear from the first equality of i|2.5|) . and & is 

a self-adjoint projection w.r.t. L 2 , as is clear from the second equality. We say a 
function / is totally nondegenerate if 3 s f = 0. 

If G = GL(2) x GL(2), and u> — {(jJi,u> 2 ) is a character of the center Z(Ap) = 
A F x A F , we denote by &\ the operator on C£°(Xg) given by 

& , if(xi,x 2 )= / f(hx 1 ,x 2 )dh= Y] x(«i) / f{y,x 2 )x{y)dy- 

JheSL 2 (F)\SL 2 (A F ) x 2 =^i X 

We define 3? 2 similarly, interchanging the role of the first and second coordinate. 
The operators ^ for j = 1,2 commute, satisfy H^j/Hz- 00 < ll/IU°° an< i are com- 
muting self-adjoint projections on L 2 . We say that a function / is totally nonde- 
generate if 0> x f = 3? 2 f = 0. 

Lemma 2.1. Let v be a place of F. The projection 2? acts by the identity on the 
subspace W C L^(Xg) spanned by one- dimensional representations of G1j 2 (F v ) 
occurring in L^(Xg). 

Similarly, 5?\ (resp. 2? 2 ) acts by the identity on the space W\ (resp. W 2 ) 
spanned by one- dimensional representations of GL2(F^) occurring in L 2 (X. x X) 
for the action on the first (resp. second) factor. 

Proof. This follows from the spectral decomposition for GL(2). For instance, it is 
known that the space W is precisely the span of functions of the form g t— > %(det ({?)), 
where x ranges over characters of A F /F X satisfying x 2 — w - O 
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2.8. Hecke operators and bounds towards the Ramanujan conjecture. Let 

[ be a prime ideal of Of and r an integer > 1. Let Fi be the completion of F at the 
prime [. Take the Haar measure on GL2(i r [) so that it assigns mass 1 to GL2(0f,). 
Define the measure fi* r on GL 2 (-F[) to the restriction of Haar measure to the set 

GL 2 (o Ft ) -(^ ^ GL 2 (0F t ), so that the total mass of ^ is N(l) r - 1 (N(l) + 1). 
Moreover, set 

( 2 - 6 ) = AT /,! r / ? Yl M*-2fc> /V ' ' 



m rt2 fr mr IMI' 

— 2 

where || • || denotes total variation. Thus ~p lr is a probability measure. Via the 
natural inclusion of GL2(-F[) in GL 2 (Af,/), we may regard [i\r as a compactly 
supported measure on GL2(Af,/); by abuse of notation, we will not introduce a 
different symbol for this measure. If n is an integral ideal of Of, factorize n = ]X P 
and put /i n = Y[ = , M n — II ~Pi r ' ■ Here Y[ is taken to mean convolution of measures 
on GL2(Afj). 

Convolution by /i n on £ 2 (X) corresponds to the nth Hecke operator; in this 
normalization the Ramanujan conjecture corresponds to it having eigenvalues < 2 
in absolute value. 

The adelic measures /z n satisfy the usual multiplication laws, appropriately in- 
terpreted: if n and m are ideals, then 

(2.7) / h(x)d(iJ, n -kiJ, m )(x) = V / h(x)dfj, nm1) -2(x), 

JPGL 2 (A F ) o|(m,n)"' PGL2 ( A *') 

whenever ft, is a function on PGL2(Af) that is invariant under PGL 2 (o,j) for all 
v\nm. 

Definition 2.1. Seta be a bound towards Ramanujan for GL2 over F, i.e. a is so 
that Hi acts on any PGL2 (op t ) -invariant cuspidal eigenfunction by an eigenvalue 
< N(/) Q + N(/)" Q in absolute value. 

Thus a — corresponds to the Ramanujan conjecture, a = 1/2 the trivial bound. 
By work of Kim and Kim-Shahidi, we can take a — 3/26. For our applications, any 
value of a less than 1/4 would suffice. 

Note that we shall slightly vary this notation (but in a reasonably compatiable 
way) in Section 13.11 and Section 19.3.11 In those parts, we shall deal with a (not 
necessarily arithmetic) quotient r\SL2(K), and a will denote a number so that 
L 2 (r\SL2(K)) does not contain any complementary series with parameter > a. 
(Here the complementary series is understood to be parameterized by (0,1/2)). 
This is compatible with the above notation, however; e.g. if T were a congruence 
subgroup, a = 3/26 would again be admissible. 

2.9. Sobolev-type norms on real and adelic quotients. 

2.9.1. General comments. Let M be a real manifold. Recall that the Sobolev norm 
on C°°(M) controls, roughly speaking, the L p norm of a function together with 
the LP norms of certain derivatives. These norms will be tremendously useful 
throughout the paper to control equidistribution rates. We shall use both the 
relatively simple definition when M = T\G and an adelic variant. 
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First let us remark on the use of L p -Sobolev norms for p > 2. This is solely 
to do with noncompactness. If we were to deal only with compact quotients, then 
the L 2 -Sobolev theory would always suffice. However, in the noncompact case, the 
L 2 -Sobolev norms do not (e.g.) give good bounds on the size of a function high in a 
cusp. There are, of course, various ways to rectify this; for example we could include 
weights that measure the height into the cusp. We have chosen instead to use L p - 
norms with p > 2, which is technically very simple, but has some disadvantages 
(e.g. it does not induce a Hilbert space structure). 

Note that we will allow our seminorms and norms to take the value 00. Thus a 
seminorm on a complex vector space V will be a function from V to R>o U {00} 
satisfying 

(1) ||Au|| = |A|||u||, for any v £ V such that < 00; 

(2) ||ui + V2W < ||i>i|| + ||«2|| if both and 1 1 1^2 1 1 are not infinite. 

It is a norm if additionally ||u|| = implies v = 0. Note that giving such a 
seminorm on V is equivalent to giving a subspace Vf C V together with a finite- 
valued seminorm on Vf. Indeed take Vf = {v G V : \\v\\ < 00}, equipped with the 
restriction of || • || . 

We remark that we do not require that our norms be complete. 



2.9.2. Non-adelic setting. Suppose r c G is a lattice in a connected semisimple Lie 
group. Fix for all time a basis B for the Lie algebra g of G and a norm || • || on g. 
For g 6 G, we denote by the operator norm of Ad(</ -1 ) : — > 0, i.e. the map 
X ^ g~ x Xg. 

For / e C°°(r\G), and 1 < p < 00, we put 

(2-8) S p , d = Yl \\Vf(9)\\Lnr\G)- 

ord(X>)<d 

Here T> ranges over all monomials in B of order < d, and V acts on / by right 
differentiation. (For example, X € g acts on / via Xf(g) — -^f(ge tx ).) 

Changing B only distorts S p .d by a bounded factor. (That is to say, if S' p d is the 
norm obtained by replacing B by another basis, then there are positive reals ci, C2, 
possibly depending on d, such that ciS Pt d < S' d < c^Sp^d- ) 

We will often use the following simple remark: Fix a Riemannian metric d{-, •) 
on G and suppose g £ G belongs to some fixed compact set. Then, for / 6 
C°°(T\G),x € T\G, we have \f(xg) - f(x)\ < S , oo4 (/)d( 5 , 1). Indeed, we may 
assume that g is close to the identity and write g — exp(X), with Xeg; now apply 
the mean value theorem to 1 1— * f(xe tx ). 

Moreover, the following elementary properties are easily verified (we only need 
them in the case p = 00). 

Lemma 2.2. Let f 1 J 2 G C*°°(r\G) and g&G. Then 

Soo,d(flf2) <§Cd Soo t ,i(fl)Soo,d(f2) 

SooA9-fi)^d\\g\\ d S oc Xfi) 

We remark that, in the case p = 2, the rule l|2.8|l also defines a system of Sobolev 
norms on any unitary G-representation; the case discussed above corresponds to 
the unitary representation L 2 (r\G). 
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2.9.3. Adelic Sobolev norms. Let's first describe what the point is intended to be 
(evidently there are many ways of implementing it, cf. Rem. 12. We would 
like to put a norm on the adelic function space, suitable for controlling e.g. period 
integrals. Consider C^°(Xg) in the case of G = SL2, F — Q, ui — 1 as a direct limit 
of spaces C°°(rj\SL2(K)), where ranges over some class of congruence subgroups 
of To := SLi2(Z). We equip each quotient Fi\SL2(R) with the SL2(R) invariant 
probability measure. Then, on each space C 00 (Fi\SL2(R)) we have the norm Sp,d 
defined in the previous section. On the other hand, typical bounds on automorphic 
forms have an implicit dependence on the "level", i.e. the index [r : so one 
would like to have a norm that increases with the level. The most naive candidate 
is, fixing a real number j3 > 0, to define the "norm" of / € C 00 (r.;\SL2(]R.)) to be 
[r : Ti]P S p ,d(f)- This unfortunately does not quite make sense when we pass to the 
direct limit: however, we can "force it to make sense" by considering the maximal 
norm on the direct limit whose restriction to each C°°(rj\SL2(K)) is bounded above 
by [r : Ti]"S Pj d(f)- This will suffice for our purposes. 

Let us formalize these ideas. In what follows we return to the setting of G 
a reductive group over F. The adelic Sobolev norms will be a family of norms 
on C2°(Xg) indexed by a triple (p,d,f3). The d and (3 indicate, approximately 
speaking, how stringently one should "penalize" rapid variation at the infinite and 
finite places respectively. 

Let p > 1, k € N,/3 > 0. Fix a basis B = {X t } for the real Lie group Lie(G(i ;l 00 )). 
Recalling the definition of from ^2.1|l . we define the pre-Sobolev functions 
PS p , d , fi onC2°(X G ) via: 

(2.10) PS p , d ,p{%l>) = [K maXiG : K^f H^IUnxcad) 

ord(V)<d 

= H [K VtG : K^f ll^lUnxo^), 

V finite ordCD)<d 

where the sum ranges over D that are monomials in B of order < d. 

The function PS Pj d,/3 does not satisfy the triangle inequality. We define the 
(p, d, /3)-Sobolev norm S Pt d,/3 to be the maximal seminorm on C2°(Xg) satisfying 
S p ,d,p(ip) < PS Pt d,p(ip)- Explicitly, 

n n 

(2.11) S pA0 m = inf{^P^(^) : = 4>> *i e C ™( X g)-} 

i=l i=l 

In fact, it is clear that the right-hand side of 12.11fl defines a seminorm that is 
dominated by PS Py d,p (take the collection {ipi} to consist of {tjj} alone); moreover, 
it is evidently maximal in the class of such seminorms. Finally, as PSp^piip) _^ 
HV'IIlp, the S p .d,{3 are in fact norms on C2°(Xg). 

It will often be useful to omit the argument (3 and set it to a "default" value of 
1/p. We therefore define S p . d := S Ptd .i/ p , for p^O, and S^d := Soo^fi. 

Notational convention: We will very often have cause to bound linear func- 
tionals L on C£° (Xq) by Sobolev norms. In writing statements of the form 
l-^(/)l ^ Sp.d, p(f), we will always allow the implicit constant to depend on p, d 
and /3 without explicitly saying so. 
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2.10. Adelic Sobolev norms — a slight generalization. The notations of this 
section will only be required in Sec. We recommend it be omitted at a first 
reading. 

In the discussion at the start of Sec. 12.9.31 we did not address what class of 
subgroups Ti to consider (should we take all finite index subgroups of a fixed Tq or 
some subclass?) Implicitly, such a choice was made in defining the Sobolev norms 
of the previous section. The Sobolev norms introduced in the previous section are 
good for most of our purposes. However, roughly speaking, they have the following 
defect: they only measure the index of a stabilizer of a function / <G C™(Xg). 

That this definition might lead to some peculiar results can be already seen in 
the case G = PGL(2), F = Q. Let \v b e the character of Aq/Q x that corresponds 
to the quadratic Dirichlet character of Q with conductor p, a prime number. Then 
the function g i— > x p (det(</)) descends to a function / on X, and it is easy to check 
that [K max : Kf] = 2, for any p. Thus the index of this stabilizer does not reflect 
the conductor of the underlying representation (which, by any reasonable definition 
of conductor, should grow as p increases). In this section we shall introduce a slight 
modification of the definitions which avoids this problem. (This problem would not 
occur for SL(2)). 

This is a purely technical matter, and it seems there is much scope for giving bet- 
ter and more natural definitions. We restrict ourselves to the case G = GL(n). For 

a finite place q and m > 0, we put X[q m ] = f ker(GL(n,O q ) — > GL(n, o q / w q n o f„) , 
where w q is a uniformizer in F q . Now, for ip € C£° (Xq ) , put K* ^ to be the largest 
subgroup if [q m ] which stabilizes tp, and put = J\ K*^,. 
We define the *-pre-Sobolev norm PS* td8 by the rule 

PS;, d , (<<P) = [K max , G :K;f H^IMxca) 

ord(X>)<d 

and we define the *-Sobolev norm S* d g to be the maximal seminorm dominated 
by PS* p ^. Clearly S* p ^ > S p ^p. 

The eventual purpose of this is that S* d g (unlike S Pt d,p) will never be "too 
small" on an automorphic representation whose conductor is large. This can be 
quantified, although we do not do so in the present document. 

Remark 2.1. Evidently the definitions of this section and the previous are not the 
only "sensible" way of defining a notion of adelic Sobolev norms. The results of 
this paper do not require any more sophisticated definition, although this would 
certainly be of help in optimizing the results. 

However, it would be interesting to impose a system of Sobolev-type norms in a 
less ad hoc fashion. Moreover, it would be pleasant if the system of norms had nice 
interpolation properties (this often is very helpful for getting sharp results). For 
example it would be nice if as one varied (3 one got a family of interpolation spaces. 

We remark on a simple way of defining Hilbcrtian norms which seems (more) 
appropriate to the adelic context. Let K[q m ] be as above, and let E qm be the 
averaging projection onto the i4T[q m ]-fixed vectors, i.e. E qm (v) — J keK ^ qm ^ k ■ vdk, 
where the measure is the Haar probability measure. Then e q m := E q m — E qm -i is a 
projection. If f = J^^ q" 1 ' is an arbitrary integral ideal, put ej := . Now put 

P(s) = E f e f N (f) S - Then / ^ EordCD)<d II 23 ' p ( s ) ■ f\\v defines a Hilbert norm 
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which seems to have reasonably pleasant formal properties. In fact it is majorized 
(up to constants) by a norm of the type described above,. 

J. Bernstein has a more canonical notion of norms on representation spaces of 
p-adic groups, and he has informed me that these norms have adelic analogues. I 
do not know the relation. The norms arising from his constructions are Hilbertian. 

2.11. Some properties and uses of the Sobolev norms. We briefly summarize 
certain results that will be used in the text. Detailed proofs are given in Sec. [S] 
For general G, lo we have: 



H2.12JI . proved in Lem. 18.11 and Q2.13|l . proved in Lem. 18.21 give some basic 
stability properties of Sobolev norms. 

Now we specialize to some results for GL(2) and PGL(2). Let F £ C°°(X x X), 
let q be a prime ideal of Of, and suppose F is invariant by PGL2(of„ ) x PGL 2 (0i? q ). 
Then: 



(|2.14|) . proved in Lem. 19.81 quantifies Hecke equidistribution. To understand 
the relation, take F to be a pure tensor: F(x, y) — fx{x)fa(y). Then (|2.14(l in effect 
bounds the inner product (T q fi, /a), where T q is the Hecke operator corresponding 
to q. 

2.12. Cusp forms, L-functions and the analytic conductor. As a general 
remark on notation - and a mild abuse of notation- by cuspidal representation we 
shall always mean unitary cuspidal representation. This is automatic for PGL(2) 
but not for GL(2). 

2.12.1. L-functions. Let tt = ® v tt v be an automorphic cuspidal representation of 
GL(n) over F. We denote by L v (s, tt v ) the local L-factor of the representation ir v ; 
when it causes no confusion, we will sometimes abbreviate this to L(s,tt v ). 

We write L(s, tt) := n« finite ^ v ( s ' 7I " t ') ^ or * ne ( nrn t e part of) the global L- function 
attached to tt, and A(s, tt) := Y[ v L v {s, tt v ) for the (completed) L-function attached 

to TT. 

2.12.2. The analytic conductor of Iwaniec-Sarnak. We recall the definition in the 
context where it will arise. Let tt — ®tt v be a cuspidal representation of GL(n) over 
F. 

For each finite place v we denote by Cond 1 ,(7r) the conductor, in the sense of 
Jacquet, Piatetski-Shapiro, and Shalika, of tt v ; thus Cond t ,(7r) = q™ v , where m v is 
the smallest non-negative integer such that tt v possesses a fixed vector under the 
subgroup of GL n (oir„) consisting of matrices whose bottom row is congruent to 
(0,0,..., 0,1) modulo m™. 

For each infinite place v, let T v (s) = 7r~ s / 2 L(s/2) or (27r) _s r(s) according to 
whether v is real or complex respectively, and put deg(w) = [F v : K]. Let fj,j }V G C 
satisfy L{s,tx v ) = \[Y V (s+jUj>), and put Cond^Tr) = U. v (l + \fr,v\) des(v) ■ We then 



(2.12) 
(2.13) 



Sp,d,p{FlF2) <§C(2 S2p,d.,(l(Fl)S2 P ,d,l3(F2). 

S pA0 {g-F)<^ \\9oo\\ d \\9ffSp,dAF)- 



(2.14) 




«e N(q) 



s P AF)- 
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put Cond(7r) = Cond^Tr) (this is within a constant factor of the Iwaniec-Sarnak 
definition). Moreover, we put Condoo(7r) = J\ v infinite Cond„ (tt) and Cond/(7r) = 
Y[ v g nite Cond„ (tt) (the "infinite" and "finite" parts of the conductor) . 

We will occasionally refer to the "finite conductor" of tt as the ideal q™" , 
where q„ is the prime ideal corresponding to the finite place v; then Cond/(7r) is 
the norm of this ideal. Hopefully the distinction between the two usages will be 
clear from context. 

Remark 2.2. (Explication for GL(1) in the archimedean case) Let us be slightly 
more explicit in the case of a unitary character u> of Ap/F x . If v is real, then 
there is t € M such that u>(x) = \x\ u for x > 0; then Cond„(w) x (1 + \t\). If 
v is complex, then there is t E M, N € Z such that w(re ie ) = |rf*e iJve ; then 
Cond„H x (1 + |t| +7V) 2 . 

We can heuristically summarize this: in the real case, uj is approximately con- 
stant in a neighbourhood of the identity of size Cond^u;) , in the complex, case 
ui is approximately constant in a disc around the identity of area Condu(u;) _1 . 

2.12.3. Cusp forms. If tt is a cuspidal representation of GL 2 (A^) or PGL 2 (A^), it 
will be convenient to denote by tt^ the archimedean representation (of GL^i 7 ^) 
or PGL^Foo)) that corresponds to it. 

By the dual GL 2 (F 00 ) or PGL 2 (i 7 ' 00 ), we shall mean the space of irreducible, 
admissible representations. We say a subset of this dual is bounded if the corre- 
sponding set of Langlands parameters is bounded. We may define, in an evident 

way, the conductor Cond(7Too) for ttoq S GL2(-Foo); with this definition, a subset is 
bounded exactly when Cond takes bounded values on it. 

In a similar fashion, we define the notion of a bounded subset of GL,2(F V ) or 

PGL 2 {F V ) for any place v, where, again GL^F,,) denotes the set of irreducible, 
admissible representations. 

3. Unipotent periods. 

In this section, we will make systematic use of the (nonadelic) Sobolev norms 
Soo.d on homogeneous spaces T\G. In rough terms, S'oo.d controls the L°° norm of 
the first d derivatives. See Sec. 12.9.21 Note in particular that 5*00,0 is just the L°° 
norm. 

3.1. Equidistribution of sparse subsets of horocycles. Let L C SL2(K) be a 
cocompact lattice. For this section alone, we will use mildly different notation to 
that of Sec. 12.51 to accommodate the fact we deal with SL 2 and not with PGL 2 . 
For put 

(3.1) n(x) = ^ o i ) > fl (X) = ( X Q x -x/2 > \M x ) = 

We denote by C(r\SL 2 (K)) (resp. C°°(r\SL 2 (IR))) the space of continuous 
(resp. smooth) functions on the compact real manifold T\SL 2 (R). We denote by 
dg the measure on SL 2 (R) that descends to a probability measure on the quotient 
r\SL 2 (R). Finally, we denote by H the usual upper half plane {z : $s(z) > 0} with 
the standard action of SL 2 (M). 
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Theorem 3.1. There exists Y max > 0, depending on T, such that {xo«(j 1+7 ) : j € 
N} is equidistributed, for any xo G T\SL2(K) and any < 7 < 7max- In other 
words, for any f £ C*(r\SL 2 (K)) ; 



J™ — Tj = / f{9)dg. 

Ar - > °° iV Jr\SL 2 (R) 

If \i is the smallest nonzero eigenvalue of the Laplacian on T\M., put a = 

J ' Al ^ T , , , (l-2a) 2 

< , . Ihen we can take 7 m „ = t^tz — ^r- 

\^l/4- Ai, else. ' i6(3-3a) 

This result represents (extremely modest) progress towards a conjecture of N. Shah, 
which asserts that the statement should remain valid for any 7 > 0. The method is 
not restricted to sequences of the specific type in Thm. 13.11 and we have also not 
optimized the maximal value for j max . Nevertheless the method is fundamentally 
limited. As it presently stands, it does not seem capable of achieving even 7 = 1. 
See also Remark l3~Tl (page EE} ■ 

The dependence of 7max on T can likely be removed, but this seems to require 
using further input (cf. last paragraph of Section ri.3.4() . 

The proof follows the line of Sec. IDTf! with Gi = SL 2 (R), G 2 = {n(x) : x € M}. 
The Yi are not quite closed G2-orbits, but rather long pieces of general G2-orbits. 
The basis {ipij} for Yi will correspond to additive characters of G2 — R. 

Let / e G°°(r\SL 2 (M)), and let a be as in the statement of Thm. O Let 
T > 1. Let ip be a fixed nontrivial character of the additive group of K. Let g be 
a fixed smooth function of compact support on K satisfying g(x)dx = 1. We 

denote by (•, -)L 2 (r\G) the inner product in the Hilbert space L 2 (r\G). 
We set: 

1 r T 1 - T 



(3.2) u T {f) = -J f(x n(t))dt, fiT.Af) = j; J o mfMt))dt. 

Remark first that the measures vt are equidistributed as T — > 00, in the following 
quantitative sense: for / £ G°°(L\SL2( 



(3.3) 



Mf) - I f(g)dg 

r\SL 2 (R) 



«T- Kl 5 co , 1 (/), 



for any k\ < 1 ^ 2 9 - . This is proven in Lem. 19.41 without taking any pains to 
optimize the exponent. (We prove it to keep the paper self-contained. However, we 
emphasize that neither result nor proof is new; see and [2]] . A precise analysis 
of the equidistribution of long horocycles is carried out in |11|.) 

Lemma 3.1. Suppose JUgWK) /(fl0^5 = 0- T nen: 

(3.4) WAf)\ «^ fo 5 oc , 1 (/), 

whenever b < g7g~2~j^ an d the implicit constant is independent ofijj. 

Remark that if ip is wildly oscillatory, cancellation in fiT,ip can be proved directly 
by integration by parts; on the other hand, if ij) is almost constant, the cancellation 
in /it,i/> arises from the equidistribution of the horocycle x§Gi. It is therefore the 
intermediate case in which (13.41) is of interest. 
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Proof. Let H > 1, and let an be the measure on N(R) defined by o-n(g) = 
jj J Q ip(x)g(n(x))dx, for g a function on iV(R). 

For / £ C°°(r\G), we denote by f*o-R the right convolution of / by an- Then 
it is easy to verify that \fiT,ip(f) — Mt,»/>(/ * *C jtSoo,o(f)- Here *(7ff denotes 
right convolution by 07^ . On the other hand, by Cauchy-Schwarz, \nT,4>{f *vh)\ 2 < 
^r(|/ * o~h\ 2 )- Thus, expanding |/*Off| 2 and applying l|3.3|) . we conclude 
(3.5) 



lMT,v(/)l«fs c,o(/)+ (7^ / 



dh\dh 2 



« f 500,0(7) 



tf 2 



(/ii,/t 2 )6[0,J?] 2 



v T {n{hi)f ■ n(h 2 )f ) 
(n(hi - h 2 )f,f} L 2( r \ G) \dhidh 2 



1/2 



1/2 



1/2 



sup Soo,i(n(hi)f ■ n(/i 2 )/) 

\ (hi,ft 2 )e[0,H]2 / 

Utilising bounds towards matrix coefficients - see Sec. 19.1.21 esp. (|9.6|l - and basic 
properties of Sobolev norms (see 7 Lem. E2>, we note: 



(3.6) (n(h)f,f)^ e (l + \h\) 



2Q-1 + E 



l-WMM/ • n(h 2 )f)\ « (i + + |/i 2 |r.W(/r. 

Thus, |mt,^(/)| < e (f + H - 1 ^ + T-^' 2 H) 5oo,i(/). Choose # so that ff<*-V2 
HT~ Kl l 2 to obtain the claimed result. □ 

Proof, (of Thm. 13.1(1 . Given Lem. 13.11 the Theorem follows quite readily by 
Fourier-expanding the measure on R that is a sum of point masses at j" 7 , for j S 
N. The argument that follows formalizes a minor variant of this argument (we 
first consider instead a sum of point masses along arithmetic progressions which 
approximate {p : j G N}). 

Let x e r\SL 2 (R), / S C°°(r\SL 2 (R)). We first claim that, if b is as in the 
previous Lemma, / so that /p\ S L 2 (R) f(9)^9 ~ ^' anc ^ ^ — ^> then: 

f(x n(Kj)) n 

ifl/b-l ' 

as if — > oo . In other words, if 1 / 6-1 points, distributed along a horocycle with 
spacing K, become equidistributed. 

This follows from Lem. 13.11 put 3,5(2;) = max(5~ 2 (<5 — |a;|),0), a function on R. 
For A S R, write a x = if _1 / R exp(-2mK- 1 \t)g s (t)dt. Then £\ £Z 9s(t + K 0) = 
Sfeez sxp(2niK kt)ak- Moreover, a simple computation shows that X^-ez \ a k\ ^ 

Choose e > so that b + e still satisfies the inequality of Lem. 13.11 By Lem. 13.11 



(3.8) 



f dt\Y,gs(t + Kj) \ f(x n(tj) 
Jt=0 J 



6-er-l 



^Lem. l2.2l would actually give the exponent (1 + \hi\ + |^2|) 4 in the latter inequality, but it is 
easy to see directly the stronger result. 
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gs has integral 1 and is supported in a 8- neighbour hood of 0; in particular, the 
left-hand side of (|3.8J) differs from J2jez o<Kj<T f( x o n (Kj)) by an error that is 

«/ (l+TK-18). Thus \Z jeZt0 < Kj < T f(x n(Kj))\ «, (l+TK^S + T^-^- 1 ) 

from which H3.7(l readily follows. 

We now deduce the Theorem from ()3.7|) . Let To € N be large. Then, for t small, 
we have (T + t) 1+7 = T 1+7 + (1 + 7)T 7 t + O(t 2 T 7_1 ). In particular, (T + f) 1+7 
is well-approximated by the linear function T 1+7 + (1 + "f)T^t in the range where 

The claim of Thm. IO follows from (EH as long as ^ > 1/6- 1; in particular, 
any 7 < 6/2 will do. □ 



Remark 3.1. The applicability of this method is not, of course, restricted to se- 
quences of the form t 1 , t 6 N. In certain (very artificial) settings, one can 
prove some effective instances of Ratner's theorem in non-horospherical cases by 
the same technique. The question of proving such results, even in very special 
cases, was raised by Margulis in his talk in the workshop "Emerging applications 
of measure rigidity." 

For instance, let r > 1 be an integer, and let T be a cocompact lattice in 
SL2(M) r . For concreteness, let us suppose that T is a congruence lattice. Let 
n(xi, . . . , x r ) = (n(xi), n(x%), . . . , n(x r )) G SL2(M) r . Then it is well-known that 
one can give an effective version of the equidistribution of n(R r '), since n(R r ') is 
horospherical. Using the method described above, one can extend this slightly: 
let V C M. r be a linear subspace. One may show that n(V)-orbits are effectively 
equidistributed if dim(V)/r is sufficiently close to 1. However, the small codimcn- 
sion condition is crucial for this method and it does not seem that it would extend 
beyond this case. 



3.2. Fourier coefficients of automorphic forms. In the notations of Sec. ^ if 
we take for the closed orbits of a unipotent group, the resulting periods are so- 
called "Fourier coefficients of automorphic forms." We shall give a general nontrivial 
bound in that context. (The word "nontrivial" must be interpreted with care; see 
the discussion at the end of this section.) Our methods are restricted to the case 
of horospherical unipotent subgroups. 

We have made no effort to optimize the exponents of the results, nor even to 
state a result of maximal generality. In fact, one can considerably increase the 
scope of Thm. 13.21 since we deal in the present section only with closed orbits 
of horospherical subgroups, one can profitably apply spectral theory. We do not 
carry this out, instead using |18| to give equidistribution statements in a fairly soft 
fashion. 

Let G be a connected semisimple real Lie group, T C G a lattice, K c G 
the maximal compact subgroup, q the Lie algebra of G, and H S a semisimple 
element. Fix a norm || • || on the real vector space g. Let exp : g — > G be the 
exponential map. Fix a Haar measure on G so that T\G has volume 1. Let u be 
the sum of all negative root spaces for H and let U = exp(u) C G. Let xq e T\G 
be so that xoll is compact. Let xt = xq exp(tH), and let At be the stabilizer of xt 
in U. We denote by (•, -)L 2 (r\G) the inner product in the Hilbert space L 2 (r\G). 
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We shall analyze periods of a fixed function along xtU as t varies. The proofs 
follow Sec. 11.3.11 with G\ = G, G2 = U, Yi = XtU, and tpij corresponding to 
characters of U. 

Let T > and let ip be any character of U trivial on At (:= exp(— TH) Ao exp(T_ff)). 
We define i/t, Mt,j/> m a closely analogous fashion to p. 21) : 



(3.9) 



vol(ArW) 



vol(A T \C/) 



Let f,g £ C°°(T\G). Let E 1 € u have unit length w.r.t. the fixed norm || 
g. It is proven by Kleinbock-Margulis in ^S] - see also Lem. 19.51 and Lem. 
that there are Ki,K2 > such that: 
(3.10) 



(exp(sE) • f,g) L 2 



(r\G) 



/ 

r\G Jr\G 



WW 



(3.11) \Mf)~ [ /l«e- K2T 5 co .di 1 „(K)(/)- 

Jr\G 

(I3.1U|I and l|3.11|l assert, respectively, quantitative mixing of the flow generated by 
U on r\G, and the equidistribution of the orbit a>p£^ as T — > 00. We may assume 
that Ki < 1 (since making K\ smaller does not change the truth of (|3.1U|0 . This 
will ease the notation in the proof of the Theorem. 

Theorem 3.2. There exists K3 > such that, for any f £ C°°(T\G) satisfying 
Jr\G / = ®> we h- ave: 

(3-12) Imt,^(/)| < exp(-TK 3 )S , 00 ^ dim(x) (/) 

for all T > 0, and for all characters ipofU trivial on Ut- 

Indeed, if o is the order of the polynomial map K — > End(g) defined by s 1— > 
Ad(exp(s-ff)), i/ien any K3 < 2 (2odim( i K')+Ki) * s admissible, being as in ^3.1U\) 
and EOT)) . 

The relevance to this to "Fourier coefficients" in the classical sense may not be 
immediately clear; after the proof, we give the example of SL2(R) to illustrate. 

Also, observe that the estimate (|3.12|) is uniform in tp. In fact, just as in Lem. 
13.11 the case when ip is constant amounts to (|3.11|) . whereas the case where i/j 
is highly oscillatory could be handled by integration by parts. It is, again, the 
intermediate case where (|3.12|) has content. 



Proof. We first remark that, other than being in a slightly more general setting, 
the proof is almost exactly the same as the proof of Lem. 13.11 

The signed measure /iT,v> satisfies fir^{u ■ f) — V'( u )mt,i/'(/)) f° r u £U . 

Take E £ u of unit length w.r.t. the norm || • || on g. Fix H > 1. Let a be the 
measure on U defined via the rule 

1 f H 

a(h) = — / vp(exp(sE))h(exp(sE))ds , 
H J s =o 
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for h any continuous compactly supported function on U. Then /it,v>(/) = Mt,i/>(/* 
a); thus: 

(3.13) WAf)\ 2 = WAf*°)\ 2 < M\f*<?\ 2 ) 

« / \f^cr\ 2 + e-^ T S ooAim(K) {\f^a\ 2 ) 
Jt\g 

Jr\G 



r\G 

.1 * "i -i- i! ' '••m» - h 2 1 aim! ais./ r 

r\G 

where we have applied Cauchy-Schwarz followed by (|3.11() . noting that by Lem. 12.21 
we have that S^^^f * a\ 2 ) « S^^tf * a) 2 « H 2odim ^ S^ dim(K) (f) 2 . 
Here o is chosen as in the statement of the Theorem. 
By (|3~TUjl . we see that 



(/)■ 

Indeed, this follows simply by expanding the leftmost expression. Thus 
WAD? « (H- Kl + J ff 2odim(K) exp(-« 2 T))5 00idim(K) (/) 2 . 
We choose H so that i/2odim(i<r)+ rei = exp ( K2r ) to conclude. □ 

Remark 3.2. We now explain, when we specialize G = SL2(M), why this recov- 
ers Sarnak's result |30j . which was the first improvement of the Hecke bound for 

nonarithmetic groups. Take H = f ^ ^ ^ G 5I2, so that U = ( J * V Let 
T C G be a nonuniform lattice so that r n £/ = { ^ ^ ^ ~\ ,n € Z}, and take 

x = y J ^ J. Let f(x + iy) = Y, n ^o a ny/yI<iv{2nny)e 2mnx be a Maass cusp 

form of eigenvalue 1/4 + i^ 2 on r\H, where H denotes the upper half-plane; it lifts 
to a function on r\SL2(R), viz. g 1— > f(g.i). 

Then the Theorem implies (in concrete language) that there exists S > such 
that, for any y < 1 and any neZ, 



(3.15) / f(x + iy)e(nx)dx<s:y . 

J0<x<l 

Taking i/x in l|3.15() . one easily deduces that the Fourier coefficients a n satisfy 
the "nontrivial" bound \a n \ < n 1 / 2 ^ 6 . 

Remark 3.3. The bound Thm. 13.21 is nontrivial in that it improves, as t — > 00 on 
the trivial bound: 

fA t \u f(xu)tp{u)du 



vol(A t \J7) 



«/ 1. 



which follows from Cauchy-Schwarz. 

However, if there is another interpretation for the Fourier coefficients, it is not 
always the case that Thm. 13.21 improves on "trivial" bounds arising from that 
interpretation. 
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For instance, Fourier coefficients of cusp forms on GL„ admit a spectral inter- 
pretation, that is to say, they are connected to the eigenvalues of Hecke operators. 
In that case, Thm. 13.21 does not give anything even approaching the bounds of 
Jacquet-Piatetski-Shalika. Another example is when G = SL2QR), and one takes 
for / the Shimura lift of a cusp form of integral weight. In that case the (absolute 
values of the squares of) square-free Fourier coefficients of / are given by special 
values of a twisted L-function; but the estimate above does not even recover the 
convexity bound (in fact, the method as indicated cannot recover this bound, even 
under optimal assumptions.) 

It seems as though, in these cases, there is extra cancellation in the unipotent 
integrals for subtle arithmetic reasons. The crude methods indicated above do not 
detect this. 

Remark 3.4. We remark that, in the proof just given, the constant K3 depends on 
the spectral gap for T\G. This dependence can very likely be removed in many 
cases, including the case of G = SL2(R), but we do not carry this out; again, cf. 
the last paragraph of Section Tl. 3. 41 In the higher rank case, if G has property (T), 
one has in any case a uniform spectral gap and this point becomes irrelevant. 

It seems worthwhile to remark that, whereas the proof above is clearly not un- 
related to that of Sarnak, it does not require any information on the decay of triple 
products (in particular, the deep "exponential decay" results proved by Sarnak and 
Bernstein- Reznikov) . 

We also remark that the proof indicated above, although it can be optimized 
in various ways, probably does not lead to as good an exponent as the work of 
Good, and the later refinement of Sarnak's result due to Bernstein-Reznikov. Its 
advantage lies, rather, in its robustness and general applicability. 



4. SEMISIMPLE PERIODS: TRIPLE PRODUCTS IN THE LEVEL ASPECT. 

In this section, we will give bounds for the triple product period on PGL2 over 
a number field F. We will use the notation of Sec. |2 in particular, Ap is the adele 
ring of F and and X = PGL 2 (F)\PGL 2 (A F ). 

In Sec. 14.11 we will give a special case of the triple product bound (Prop. 14.1(1 
which does not require Sobolev norms to state. For some applications we will 
require a slight generalization, which will require the Sobolev norms of Sec. 12.9.31 
This will be carried out in Sec. 14.21 (see Prop. I4.2fl . 

4.1. Period bound for triple products. We now give a period bound for triple 
products on PGL 2 . Although it is unfortunately somewhat disguised in the adelic 
language, the situation and method corresponds to that of Sec. 11.3.11 with G\ = 
PGL 2 (F S ) x PGL 2 (F S ), G 2 = PGL 2 (F 5 ) embedded diagonally. Here S is a set of 
places of F containing all infinite places, and F$ — lines 

For 1 < p < 00 we will write LP for L P (X.). Thus, e.g., ||/i||z,4 denotes 

tkl/x^l 4 ^) 174 . 

Proposition 4.1. ("Subconvexity for the triple product period.") Let n be an 
automorphic cuspidal representation o/PGL 2 (Ai?) with prime finite conductor p. 
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Let /i,/2 € C°°(X) be totally nondegenerate 8 and such that fi,f% are PGL2(of p )- 
invariant. Let tp £ ir and suppose 9 that there exists b £ M such that 10 

(4.1) J] N(q) < N(p) b 

qeSupp( V )USupp(/ 1 )USupp(/ 2 ) 

Put I(tp) = J x fi(g)f 2 (ga([p]))ip(g)dg, where a([p]) is as in \2.$ and dg is the 
PGL/2(Ai?) -invariant probability measure. Then 

(4.2) 11(^)1 < 6 , e ,F ||/i ||l 4 H/2 ||i 4 1| ^IU=N(p) e - ( ^t-^) a) 

We refer to Prop. 14.11 as subconvexity for the triple product period, cf. first 
assertion of Theorem 15.11 We note that, with a — 3/26 (Kim's bound) we have 
^"/(■j-la) 2 "" 1 > 1/26. As we will see, Prop. 14. II is a very strong result that implies 
many subconvexity results on PGL(2). 

First let us explain the content of Prop. I4.1l in a classical setting, and how it can 
be regarded as the type of period bound discussed in the introduction. Suppose 
F = Q; let p > 1 and let 

ro(p) = {( ! b d )eGL 2 (Z):p\c}, 



c 



and put Y{p) = r (p)\PGL 2 (R). Then there is an embedding Y(p) Y(l) x F(l) 
which corresponds to the graph of the pth Hecke correspondence on Y(l); the image 
is a certain closed orbit of the diagonal PGL 2 (R). Let fi, f 2 be fixed functions 
on Y(l) and ip a Maass form on Yq(j>). Then the function /1 x f 2 : (xi,x 2 ) 1— > 
f\{xi)f 2 (x 2 ) is a function on 1^(1) x y(l), and we can construct its restriction 
/1 x /2|f(p) by means of the embedding indicated above. Then, translating from 
adelic to classical, one finds that (|4.2|l furnishes precisely an estimate for f Y ( p )(fi x 
f2)\y(p) l P- When we vary p, if and hold f 2 ) fixed, such an estimate falls precisely 
into the pattern described in the introduction: we are computing the periods of the 
fixed function /1 x f 2 along the varying sequence of sets Y(p). The fact that the Y{p) 
become equidistributed in Y(l) x Y (1) is precisely equivalent to the equidistribution 
of p- Hecke orbits on Y{1). Moreover, the key property of (f that is used is the fact 
that ip is an eigenfunction of many Hecke operators; this is used to construct the 
measure a, in the notation of Sec. 11.31 

Prior to beginning the proof, we make some comments about applications and 
generalizations; for details, we refer to Sec. El The implicit constant of 1)4.2(1 is 
independent of ft, f 2 . Taking f\,f 2 to be a pair of cusp forms Prop. 14. II implies 
(conditional on some computations of p-adic integrals that we state as Hypothesis 
Prop. Ill.l[) subconvexity for certain triple product L-functions. Similarly, taking 
/i) /2 to be a cusp form and an Eiscnstein series, resp. a pair of Eisenstein series, 



^See Sec. 12.71 equivalent to "orthogonal to locally constant functions" in this case. 
^Recall that a finite place v belongs to the support of / 6 C°°(X) exactly when PGL2(o t) ) 
does not fix /. 

^This assumption 14.11 is purely technical and the reader may safely assume that ip is spher- 
ical at all places away from p and /1 , /2 are everywhere spherical without losing the gist of the 
argument. It is not used in the present document, but will probably be of use in establishing 
polynomial dependence of subconvex bounds. 14.11 ensures, among other things, that there are 
many places when all of /i,/2,¥? are unramified, so that we can use the Hecke operators at those 
places. 
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Prop. 14.11 implies subconvexity for Rankin-Selberg convolutions and standard L- 
functions. 

The latter applications are rather delicate because Eisenstein series are not in i 4 . 
To get around this we will eventually replace the Eisenstein series by an appropriate 
wave-packet (cf. proof of Thm. I5.1fl . 

Proof. Clearly we may assume that ||<p||l2 = ||/i||l 4 = II/2IU 4 = 1- It follows that 
\\h\\ L * < 1 and ||/ a || za < 1. 

We shall moreover assume, for simplicity, that (p is spherical at all hnite places 
v ^ p. The reader may verify that the proof carries through to the more general 
situation of Prop. 14. II without modification. 

Put q = N(p). Let a be a (signed real) measure on PGL 2 (A^) such that tp-ka = 
Atp, for some A G C. We shall assume that supp(a) commutes with PGL 2 (Q P ); we 
will choose a later. Set further = f 1 (x)f 2 (xa([p])) G C°°(X). Then 

(4.3) 

1/2 



\-I((p) = J &(x) ■ (ip * a)(x)dx = J (iff-kcr)(x) ■ <p(x)dx < (J \^-ka\ 2 dx^j 



(g ■ ¥) (g> ■ <f)da(g)da(g')dx 

'XJ( 9 , 9 ')6PGL 2 (A F )2 , 



h{xg)f2{xa([p])g)f 1 (xg')f 2 (xa([p])g')da{g)da{g')dx 

X-'(g,s')6PGL 2 (A J? ) 2 j 



1/2 



1/2 



1/2 



fi(xg)f2(xga{[p}))f 1 {xg')f 2 {xg'a([p}))do-(g)do-(g')dx 

IXJ(g,g')<£PGL 2 (A F )2 ) 

In the last step, we have used the fact that PGL 2 (Q p ), and thus a([p]), commutes 
with supp(er). 

For any two functions h\,h 2 on X, both right invariant by PGL^Oi^), the 
assumed bound on Ramanujan (Def. 12. 1|) implies: 



(4.4) 



hi(x)h 2 (xa([p]))dx - x(p) / hi(x)x[x)dx / h 2 (x)x(x)dx 
x -i 

<2q a -^\\h 1 \\ L2 \\h 2 \\ L , 



Here x ranges over all characters of Ap/F x such that x 2 = 1, an d x( x ) denotes 
the function g i-> x(det(g)) on PGL 2 (F)\PGL 2 (A F ). Indeed, to see (|4.4I) . we note 
that the quantity inside the absolute value on the left hand side of 1(4. 4|1 equals 
(hi — &hi,a([p]) ■ (h 2 — £Ph 2 ))L2, where 2? is as in Sec. O (in this case, the 
orthogonal projection onto the locally constant functions). But the I? orthogonal 
projection Id — kills all one-dimensional PGL 2 (i 7 'p) representations, from which 
the result follows easily. 

The functions hj(x) = fj(xg)fj(xg / )(j — 1,2) are PGL 2 (op- |J )-invariant for 
g,g' G supp(er) since supp(u) commutes with PGL 2 (F p ) and p Supp(/i) U 
Supp(/ 2 ). Moreover, ||/ij||z,2 < ||/.j||f,4. Apply 14.4(1 to these hj, and substitute 
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111 It results: 

(4.5) |A./M| 2 «g Q - 1 / 2 || < 7|| 2 

+ E / lia-'g' ■ hJi^xtlig-'g' ■ f2,f2®x}dW\(g)d\a\(g') 

where |cr| is the total variation measure associated to er, ||cr|| = |er|(X) is the total 
variation of a, fi®x IS the function x i— > /i(x)x(det(x)), and brackets (•, •) denote 
inner product in the Hilbcrt space L 2 (X); we will suppress the reference to £ 2 (X) 
here and in the rest of the argument. 

Put a' 2 ) = \a\ * \a\. We may rewrite the previous result as: 

(4.6) |A| 2 |/(^)| 2 «^ 1 /2|| (Tj |2 

+ (f g E \(9-fiJi®x)\-\(g-f2j2®x)\daW(g) ) j 

We shall take a in (|4.6() to be a linear combination of Hecke operators. We 
follow the notations introduced in Sec. 12.81 For n <^ Supp(<p), we denote by A(n) 
be the nth Hecke eigenvalue of ip, i.e. ip * )i n = A(n)(p. With our normalizations, 
the Ramanujan conjecture amounts to |A(l)| < 2 for [ prime. 

Let a n be a sequence of complex numbers indexed by integral ideals of . 
Assume moreover that a„ = whenever n is divisible by any place in Supp(yj) U 
Supp(/i) U Supp(/ 2 )- Let a be the measure on PGL 2 (Afj) defined by Xm a nMn- 
Then a is symmetric under g i— * g , and \a\ = |a n |/x n . Moreover, (p-ka = \<fi3, 
where A = J2 n a nA(n). From the assumed bound on Ramanujan (see Sec. 19.11 esp. 
equation 1)9. ljl ) 11 . an elementary computation shows: 



(4.7) 



lg£PGL 2 (A F ) 



I (g ■ h , h <8> x) (9 ■ I 2 , h ® x) 1 4% (<?) 



« e N(n) 



2a-l/2+e 



Moreover, for fixed g S Supp(/i n ), the inner product (3/1, /i €5 x) is nonvanishing 
only if x is unramified at those places at all places not in Supp(/i) and not dividing 
n. The number of such quadratic characters is O e (N(n) e q e ), where the implicit 
constant (as always) is allowed to depend on the base field F. Thus: 



(4.8) 



E / l<ff-/i./l®X}(ff-/2,/2®X>|d/in(ff) 
x 2 =1 Jg 



« £ g e N(n) 



2a-l/2+e 



The total variation of a may be computed: 
(4.9) |M|« £ £ N ( n ) V2H 



A small caution here is that the vectors fi ® \ and $2®X need not be invariant by GL2(of p )i 
if p|n, because \ mav be ramified. However, GL2(o/? p ) always fixes the line spanned by either of 
these vectors, and the bound of 19.11 depends only on the dimension of the GL2(c>i? p )-span of the 
vectors in question. 



SPARSE EQUIDISTRIBUTION PROBLEMS, PERIOD BOUNDS, AND SUBCONVEXITY 33 



Using 1)2.7)1 to compute a^ 2 \ and combining 1)4.6(1 . (|4.7(l and 1)4. 9|) . we conclude: 
(4.10) 

'(En N(n)V2 + e| an |) 2 ,-i/ 2 + Enm Eo|(n m) ( N (-)) 



2a-l/2 . ,, A 1 / 2 



flu «n 



IE„ OnA(tl) 



The choice of a n follows an idea of Iwaniec; we slightly modify the standard 
choice so that we do not need to appeal to Ramanujan on average. 12 Fix K with 
g i/iooo < k < q 1000 . Let S be the set of prime ideals [ such that N([) G [K,2K] 
and [ </ Supp(/i) U Supp(/2) U Supp(< f o). In view of the assumptions, IS"! ^> € K 1 ^ c . 

For z G C we put sign(z) = z/\z\ for z ^ and sign(0) = 1. Put 



(4.11) 



sign(A(n)), n G 5 
sign(A(n 2 )),n= P,lG S 



0, else. 

Tien IE„»» A (n)l »,,F K'-, (£„ N(n)" 2 +'jo n |) «, /f 2+ % and 

(«*) E E H=)f "'Vim 

n,m 0|(n,m) 

We deduce from 1)4.10)1 that 

f K i a-l/2 +X l+4^ 1 /2 

(4.13) « e , F (qKy^L 



K 

1/2-a , (2e»-l/2)(l/2-a) 

Taking K = q 3- 4 ° , we obtain |I(<^)| < £ ,_f 9 3 ~ 4q • □ 

4.2. A technical generalization. For certain applications, we shall require a 
slight generalization of Prop. 14.11 in which the role of g i— > /i(5)/2(<?a([p])) is 
replaced by <? i— > F(g, gaQp])), where F is a function on Xx X that is not necessarily 
of product type. Although the method of proof is identical to Prop. 14. II the details 
are slightly more technical; in particular, to state the result we will have need of 
the adelic Sobolev norms discussed in Sec. 12.9.31 We shall also use the notion of 
totally nondegenerate for functions on X x X: see Sec. 12.41 

Proposition 4.2. Suppose F G C°°(X x X) is totally nondegenerate. Suppose 
moreover that there is b G R with 

(4.14) J] N(q) < N(p) b 

qeSupp(f)uSupp(ys) 

Let 7r be a cuspidal representation of PGL2 over F, with conductor p, and put 
I(tp) = J x F(x, xa([p]))ip(x)dx, for (p G 7T. Then, for any p > 4, d^> 1, 

|/(^)|« 6 , e N(p)-P+*\\<p\\ L2 S Ptd , Vp (F), 
where = (1 ~ 2 ff ~) 4a) ■ 

With Kim's bound a = 3/26, we obtain t 1 - 2 ")^ 4 ") > 1/17. 



The argument that follows was improved by a suggestion of P. Michel. 



34 



AKSHAY VENKATESH 



Proof. The proof follows closely the proof of Prop. 14.11 the only difference is that 
we apply (|2.14|) (proved in Lem. I9.8JI in place of (f 4 . 4|) . Again, we may freely assume 
that 11^11^2 = 1; again we put q = N(p). 

Let notations be as in the proof of Prop. 14.11 in particular, a is a signed real 
measure on PGL^Af,/) whose support commutes with PGL 2 (F p ), and A G C 
satisfies ip * a = \(p. Proceeding as in that proof, and in particular as in l|4.3(l . we 
obtain: 

(4.15) |A/(^)| 2 < / / F((x,x){g,g)(l,a([p]))y 



F((x,x)(g',g')(l,a([p])))da(g)da(g') 



For any g,g' G PGL 2 (A F ), set F g , g ,(xi,x 2 ) = F((xi, x 2 )(g, g))F((xi, x 2 )(g\ g')). 
Then F g>g ' is invariant by PGL 2 (o FlJ ) x PGL 2 (o Fp ) for g,g' G supp(cr). By Hecke 
equidistribution in the form of i|2.14H (proved in Lem. I9.8|) we see that for p > 

(4.16) 



L 



F g ,g'{{x,x)(l,a([p})))dx - V x(p) / ^ 9 ,g'(a;i,a;2)x(a ; i)x(a;2)rfxi(ix2 

X 2 = 1 JXxX 

« e q {2a ~ 1)/p+t S p , d (F g , g ,). 



Here, as before, xi x ) denotes the function on X defined by g i— > x(det(<?)). By 
definition | J XxX F s , 9 /((a;i,a;2))x(a;i)x(^2)| = I ((g'^g, sf~*J9 ) F , ^(x, x ))l^(XxX) I ■ 

By the basic properties of adelic Sobolev norms f (|2.12l) and (|2.13l) . proofs in 
Lem. Oand Lem. EJ 

Sp,d{F g , g ,) -.= s Ptd>1 / p (F g>g ,) < s 2pi d,i/ P {{g,g) ■ F)s 2p ,d,i/p((g',g') ■ f) 

<\\g\\ 2/ ' p \\g'\\ 2,p S 2 p, d s/p{F) 2 . 

Let us remark that the factors ||g|| 2 / p and ||g'|| 2 ^ p arises in the following way: 
Lem. 18.21 actually gives a factor \\(g, g)]] 1 ^, where the norm || • || (as in Sec. 12. 4|) 
is computed in PGL 2 (A F j) 2 ; this equals ||g|| 1 ' p where the norm is computed in 
PGL 2 (A FJ ). 

Choose a as in the proof of Prop. 14.11 (see esp. paragraph before (|4.7|0 and 
choose the coefficients a n as in that proof (see H4.11JI ). In particular, \\a\\ <C K 2+e . 
Since F is totally nondegenerate, the matrix coefficients ((g' _1 g, g'~ 1 g)F, F) satisfy 
bounds that are of the same quality as in the proof of Prop. 14.11 in particular, as 
in f£gfr : 

E / l((.9,5)^^®(x,x))l^n(5)«9 £ N(n) 2 "- 1 /2+ i F|| 2 

Finally \\g\\ <C e K 2+e for all g G supp(cr). Proceeding just as in the previous 
proof, 



\I(<p)\ « e {qK) 



(KW 2a - 1)/p K^S w/p (F) 2 + ^ 1+4 °||F|| 2 2(XxX) ^ 

K 



1/2 
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Consequently, for any p > 2, 



\I(ip)\ « e (qKY 



K s q^-^S 2pA1/p (Ff + K^\\Ff 1 



K 



To conclude, choose K = g j>c—»<*) and replace p by p/2 (thus, e.g., p > 2 becomes 
p>4). □ 



5. Application to L-functions. 

We now present the first applications to subconvexity. The rough idea is sim- 
ply that certain L-functions are expressed as period integrals of the type that are 
bounded by Prop. 14. II and Prop. 14.21 There is one significant issue in implement- 
ing this (rather evident) idea: namely, the integral representation that we use for 
Rankin-Selberg and the standard L-functions involve Eisenstein series, which are 
not in L 2 ; this causes problems in applying Prop. 14.11 

Thus we need to regularize. Two natural ways of doing this are to replace an 
Eisenstein series by a "wave-packet" ; or to use a suitable form of truncation in the 
defining integrals. In the present paper we will use the wave-packet technique; in 
the paper [221 we shall also use truncation. 

Let us briefly describe the wave packet technique in a classical language. Roughly 
speaking we can express the Rankin-Selberg L-function of two classical forms /, g 
via an integral of the form L(s, / x g) = J f(z)g(z)E(s, z), for some Eisenstein 
series E(s). We now regularize, replacing E(s,z) by a wave packet. Let h(s) be 
any holomorphic function: then 

(5.1) / h(s)L(s,fxg)ds= f f(z)g(z) f h{s)E(s,z). 

JRe(s) = l/2 Jz JSR(s)=l/2 

We wish to eventually recover an upper bound for L(l/2, / x g) (say) from the 
left-hand side, so we take h(s) = L(l — s, f x g). Then h(s)L(s,f x g) is positive 
along 3?(s) = 1/2. To apply Prop. 14.11 to the right-hand side of 1)5.1(1 . we shall 
moreover need to control the behavior of the regularized Eisenstein series Eh = 
Jsr(s)=i/2 H S )E( S > z ); this type of analysis is carried out in Sec. 110. 21 and Sec. 110.31 
the main point being that the divergence of the Eisenstein series comes entirely 
from the constant term. 

It is worth remarking that Iwaniec's bounds for the L-function near 1 enter rather 
crucially in this analysis: in effect, we bound Eh by an easy argument involving 
shift of contours; this necessitates that h be estimated on a line 5R(s) = — e, which 
amounts to estimating L(s, / x g) for 5R(s) = 1 + e. 

In what follows we have not attempted to obtain polynomial dependence in all 
parameters. This is not hard to do — and, at its essence, a statement that one 
can find analytically suitable test vectors in a Rankin-Selberg integral; but we have 
not done so here. On the other hand, we give full details of this procedure in the 
proof of Thm. 16.11 (in which the polynomial dependence is particularly useful for 
applications). 

Theorem 5.1. Let 7Ti,7T2 be fixed automorphic cuspidal representations of PGL2 
over F ; fix tgl. Let tt be an automorphic cuspidal representation with conductor 
p, a prime ideal that is prime to the conductors o/vri and tt2- 
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Then, assuming Hypothesis ^ 1 . 1\ 

(5.2) £(i,7ri®7r 2 ®7r) < ffoo N(p) 1 "^ 
and, unconditionally: 

(5.3) \L(^+it,nx^Tr)\ 2 ^ noo N(p) 1 -™ 

(5.4) | L (I +i t i7r )|4 <<7r=o N (p)l-eSa 

In i/iese statements, the notation <^. Woo indicates an implicit constant that de- 
pends continuously on the local archimedean representation ir^ of G\j2{F 00 ) under- 
lying 7T. 

Note we make no claim about the dependency of the implicit constant on t, tti,tt2] 
as remarked above, this dependence could be made polynomial in the conductors, 
but this would require more careful analysis of the archimedean integrals. 13 

We remark that we have used H.Kim's bound a = 3/26; any value of a less 
than 1/4 would give subconvexity and under Ramanujan one obtains for (|5.2() the 
exponent 5/6. The exponents for (|5.3() and i|5.4[l can be improved, e.g. the present 
proof does not take into account the fact that unitary Eisenstein series satisfy 
Ramanujan! 

5.1. Results relating periods and integral representations. For the conve- 
nience of the reader, we summarize here the results that relate periods and integral 
representations (proved in later sections). Roughly speaking, any integral represen- 
tation for an L-function expresses it as a period integral with certain test vectors 
belonging to the space of an automorphic cuspidal representation. 

A delicate point, which is quite relevant to issues of polynomial dependence in 
auxiliary parameters, is precisely which test vectors. In principle, the proofs of 
results about integral representation give explicit test vectors. In practice, it is 
tedious to extract these explicit test vectors. Our policy throughout this paper is 
the lazy one: to deduce results, as far as possible, by formal arguments and without 
choosing explicit test vectors. The price of this is that we will not obtain not quite 
the L-function, but rather a holomorphic function that differs from the L-function 
by some harmless factors. 

More precisely, the content of the Proposition (Prop. 15. 1|) that follows is that 
one can write down an integral representation I(s) for the L-functions of interest, 
so that: 

(1) /(1/2) is not too much smaller than L(l/2) - or with 1/2 replaced by the 
point of interest - so that a bound for /(1/2) gives a bound for L(l/2). 

(2) I(s) is not too much bigger than L(s) for any s. This type of control will 
be useful in shifting contours. 



It is important to note, however, that this is an entirely local problem; it is intended that 
this will be carried out in a more general context in I23| . Both for applications and to illustrate 
procedure, we have carried out this type of analysis for the results on subconvexity of character 
twists in Section |g] Those results are proved with polynomial dependence on all parameters. 
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One might prefer to get I(s) = L(s) but we don't need this stronger statement. 

As is discussed at length in Sec. to a Schwarz function \1/ on is associated 
a family of Eisenstein series E<s,(s,g) on X, which varies meromorphically in the 
parameter s G C. 

Proposition 5.1. Let so,to,t' G C. Let m be a fixed automorphic cuspidal rep- 
resentation o/PGL^Af) and n an automorphic cuspidal representation of prime 
conductor p; assume that the finite conductors ofir,~K\ are coprime. 

Denoting by ix^ the representation of PGL^-Foo) corresponding to n, suppose 
that Cond(7r 00 ) is bounded above; equivalently, ix^ belongs to a bounded subset 14 of 
the dual PGL^i^oo) (in what follows the implicit constants may depend on these 
bounds). 

There exists a fixed finite set T of Schwarz Bruhat functions on A|, and a real 
number C > so that: 

There exist vectors <p>\ G TTi, ip G tt and ^ G T so that 

$fsl ■= Mo) 1 -" ^ x ^)^Pi(ga{.[p]))E9(s, g)dg 

' ' [ ' A(s,7Ti <g>7r) 

is holomorphic and satisfies: 

(1) |$(s )|»l and |$(s)| «Cl !R ( s )l(l + |s|) c ; 

(2) At any nonarchimedean place v such that tt\ is unramified, ^ v is invariant 
by PGL 2 (0j^); at any nonarchimedean place v such that tti and tt are both 
unramified, <p, ipi are both invariant by PGL 2 (0i? l ,). 

(3) HvJiIIl- < 1 and |M| L 2 (X) < e N(p) £ . 
Moreover, there exist vectors ip G tt, € J 7 so </ia<: 

f5 5) $(t t'l = NfpW 2 -' ^ x y(g) ^ 1 (g ' I + Qggg (g°(M)' I + * / ) rf g 
l ' ' {P> A(i +< + <', 7r)A(i+i-t',7r) 

is holomorphic and satisfies: 

(1) |$(to,*d)l » 1 « ci K «i+W)i(i + |t| + |t'|) c . 

(2) For any nonarchimedean place v , each^i and^2 is invariant by YGL2{of v ); 
for each place at which tt is unramified, the same is true of ip. 

(3) IMI^pq « e N(p) e . 

Proof. Lem. 111.41 and Lem. 111.51 □ 
In effect, we could achieve "$ = 1" in Prop. 15. II by a more careful choice of local 
data; but this is irrelevant for the purpose of global estimation. 

5.2. Proof of Thm. ISTT1 

Proof, (of H5.2fl ). It follows from Hypothesis 111.11 and Proposition l4.ll 

Proof, (of JSjj?}). The basic idea is that the Rankin-Selberg convolution is a triple 
product, with one factor being an Eisenstein series. However, one cannot naively 
apply Prop. 14. II since Eisenstein series do not belong to L 4 (X). To avoid this, we 
will use a wave-packet of Eisenstein series. 

First, we can assume from the start that tt^ belongs to a bounded subset of the 
dual PGL2(i 7 'oo)- The implicit constants in the proof that follow depend on this 
subset. We denote by A the completed L- function. We begin by remarking that 



14 See Section TTTFSi for definition 
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since N(p) — * oo we may assume that 7Ti is not isomorphic to n, or to any quadratic 
twist of 7r. In particular, we are free to assume that A(s, m <g> 7r) has no poles. 
Moreover, the finite conductor of A(s, ik\ ® 7r) differs from N(p) 2 by an absolutely 
bounded constant. 

Fixing t e R, let ty,(p,ipi,E\f,(g,s),<& be as in Prop. 15.11 with s = 1/2 + it , 
so that |$(l/2 + ito)\ 1. For simplicity we write simply E(g,s) for E^,(g,s). 
Fix k > 0. In the rest of the proof we omit the subscript k, e from <C, with the 
understanding that all implicit constants depend on k and e. Put 

7( S )=N(p) s - 1 A(,s,7r 1 ® 7 r)$( s )= f Vl {ga{[p]))E{s,gMg)dg 

ix 

From Iwaniec's upper bounds for L- functions |15l Chapter 8] , the functional equa- 
tion for A, and the bounds on <I> furnished by Prop. 15.11 

(5.6) + k + it)\ « (1 + |t|)- 4 N(p) K+e , \I{-k + it)\ « (1 + |t|)- 4 N(p) K+e . 



Put h(s) = s(l — s)(s — \) 2 I{1 — s). Then h(s) is holomorphic in — k < 3?(s) < 
1 + k and h(h) = 0. 15 Moreover h(s) has rapid decay as 3(s) — > oo, in view of 
the T-factors of the completed L-function. Put Eh(g) — Jjj/ s \_ 1+k h(s)E(s,g). It is 
proved in Lem. ITHlfl that. for such /i, ||_B ft (p)||ic» < + + + k + it)\)dt. 

Applying we conclude that \\E h {g)\\ L ^ < N(p) K+e . 

On the other hand, we see from the definition of I(s) that 



(5.7) / h(s)I(s)= h(s)ds i Pl (ga([p]))E(s,gMg)dg. 

The double integral on the right hand side of (|5.7|) is absolutely convergent and 
orders may be switched; thus 

h(s)I(s)= f Vl (ga{{p]))E h {g)v{g)dg = f M9a([p}))E° h {g)<p(g)dg, 

}R(s) = 1+k JX JX 

where E® :— £P(Eh) is totally nondegenerate (see Section l2~7jl and satisfies <C e 
N(p) K+e . 

We then deduce from Prop. 14. II that: 



(5.8) 



h(s)I(s) 

K(s)=l+K 



< IMU*(x)N(p) 



I(s) and h(s) both decay exponentially rapidly as 3(s) — > oo. It is therefore 
simple to justify shifting the line of integration in l|5.8|) to 5R(s) = 1/2. We deduce 
thereby that 



(5.9) 



/ t 2 \I(l+it)fdt 



< ||Vi|U*(x)N(p) 



The fact that we impose h(l/2) = has a very concrete meaning in classical terms. Fix, 
for example, a form / and t 6 R. Consider l-M 2 + / ® g)| 2 where the sum is taken over a 
basis of holomorphic Hecke eigenforms of level N and trivial Nebentypus. If t = 0, this has the 
asymptotic behaviour A r log(A r ) 3 . On the other hand, if t ^ 0, it behaves like A r log(A r ). Forcing 
h(l/2) = "counteracts" this extra singularity. 
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From (|5.6(l we deduce bounds on / and I' inside the strip < 5ft(s) < 1 by the 
maximal modulus principle. In particular, 

(5.10) \i>{ l -+it)\«. t npY +t - 

Combining (|5.9(1 and 1)5. 10[) . and recalling that k is arbitrary, we obtain |J(l/2 + 
it) | <C t N(p)5^B+ £ . Thus |A(i +^0,^ ® tt)| < £it N(p)2-T3o+ £ . □ 

Proof, (of l|5.4p .) The proof is similar to that of J^J, but a slightly more elaborate 
regularization is required, since we shall proceed from the expression Q5.5J1 of L(s, tt) 
as a triple product against two Eisenstein series. Again we may assume from the 
start that ttoo is confined to a bounded subset of GL 2 (i 7 'oo); the implicit constants 
will, again, depend on this subset. 

Let A(s,7r) be the completed L-function attached to tt. Fixing to,t' S iR, Prop. 
15 .11 gives the existence of Eisenstein series E^ 1 (g, s) — Ei(g, s), Ey 2 (g, s) = E 2 (g,s) 
on X, and <p G tt so that 



$(t,t') :=N(p) 



l/2-t ix f(9)Ei(g, \ + t)E 2 (ga([p}), § + t')dg 



A(| + t + t',7r)A(§ +t-t',TT) 

satisfies |$(to,f )| > 1 and $(t,t') < <?|SR(i)|+|9i(i')| (1 + | f | + | t |/|)C 
We put 

I(z u z 2 ) = z 2 )N(p) 2l - 1 / 2 A(i + zi + za, tt)A( J + zi - z 2 ) 

(5.11) 

= $(zx, z 2 )N(p)^- 1 /4 A (i + Zl + z 2 ,^)N(p) fi ^- 1 / 4 A(i + zi - z 2 , tt) 

Then I{zx, z 2 ) is a holomorphic function of (zi, z 2 ) G C 2 . I(zi, z 2 ) has rapid decay 
along "vertical lines", that is, for a, a' in a fixed compact set and (t,t') G K we 
have I {a + it, a' + it') <at (1 + \t\ + |t'|) _iV . 

Let k > be fixed. From l|5.11|l . Iwaniec's bounds for L- functions near 1, and 
the rapid decay of / along "vertical lines," we obtain by the maximal modulus 
principle: 

(l+lzil+la&D* max(|I(*i, z 2 )\, \d x I{ Zl , z 2 )|, \d 2 I{z u z 2 )\) «at N(p) K , \$t( Zl )\+\$t(z 2 )\ < 1/2+k, 

where d\ (resp. d 2 ) is the operator of differentiation w.r.t. z\ (resp. z 2 ). Put 

(5.12) h(z u z 2 ) = z 2 z 2 (l/4 - z 2 )(l/4 - z 2 )/(-zT,-zi). 

Then, in the notation of Sec. 110.31 fesp. Dcf. llO.lf) , h belongs to the space H( 2 \k) 
and satisfies \\h\\ N <Cjv N(p) K . 

Put / = J 3t{zi)=3t(z2)=0 h(z 1 ,z 2 )I(z 1 ,z 2 )dz 1 dz 2 . Then: 

(5.13) 

1=1 h{z x ,z 2 )dz x dz 2 / y{g)E x {g,l/2 + zx)E 2 {ga{[p]), 1/2 + z 2 )dg 

JSR(zi)=0,SR(z 2 )=0 

= f <p{x)E h ((x,x)(l,a(\p])))dx 

where the function Eh on X x X is defined by 



E h {9i,92)= / h(z 1 ,z 2 )E 1 (g 1 ,l/2 + Zl )E 2 {g 2 ,l/2 + z 2 ), 

'SR(zi)=0,SR(z 2 )=0 
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and the interchange of orders is justified by the (easily verified) absolute convergence 
of the double integral defining /. Note that Eh{gi,g2) is totally nondegenerate (see 
Section YF7\ for definition). We now apply Prop. 14.21 to conclude that |7| <C P ,(2 
Sp.d,2/p(E h )\\(p\\ L 2N(p)~ TT ? for any p > 4, d >• 1. We note at this point that the 
requirement p > 4 makes it critical that the regularized Eisenstein series Eh belong 
to L 4 ; the trivial fact that Eisenstein series belong to L 2 ~ e is far from sufficient. 

On the other hand, by Lem. 110.91 S p _ d ^/ p {Eh) <§C ||^||jv I0r some N, possibly 
depending on p, i By Prop. 15.11 \\^>\\l 2 N(p) e . 

Thus |/| < e N(p) 6 - 1 / 68 . Now, by the definition of h (|5~T2T> we have I = 
N(p)- 1 r (M , )ER2 (l/4 + t?)(l/4 + tDtltllliituih^dhdh. Thus we obtain: 

(5.14) f |/(it 1 ,^ 2 )| 2 ^|^i^2<6N(p) 1 - 1 / 68+£ . 

J(t,t')m 2 

Using l|5.14Jl . and the given properties of <E>, we deduce that |A(i + io + *o)A(5 + 
to — t'o)\ 2 ^to.t'g N(p) 1 ^ 1 / 600 in a similar fashion to the conclusion of the proof of 
ipp]) . We take t' = to conclude. □ 

6. Torus periods (I): subconvex bounds for character twists over a 

NUMBER FIELD. 

In this section we shall work in considerable generality; we shall derive subconvex 
bounds without any assumptions of prime or squarefree conductor, and obtaining 
polynomial dependence in all auxiliary parameters. This is useful for applications, 
but will involve some notational overhead. As a result, we have sacrificed good 
exponents for simplicity at many steps. 

Theorem 6.1. Let tt be a (unitary) cuspidal representation o/GL2(Af), and x a* 
unitary character of Ap/F x , with finite conductor f. Then there is N > such 
that 

(6.1) L(i tt x x) « CondW A 'Cond 00 ( X ) JV N(f) 1 /2-^, 

(6.2) L{ 1 -, x) « Cond 00 ( X ) Ar N(f) 1 /4-^o 

Note that the result also implies a corresponding statement for the L-functions 
evaluated at h + it, since one may replace x by xl ' \ %t - 

Since it is perhaps hidden in the proof where the polynomial dependence on 
conductor arises, we would like to explicate it now. If 7r is an automorphic cuspidal 
representation with analytic conductor Cond(7r), there exists a vector tp £ tt with 
Sobolev norms S2,d,f3(ip) <C Cond(7r) constmax (' 3 ' d ). Moreover, one can choose such 
a ip to be a "good" test vector w.r.t. certain toral periods. Thus the analytic 
conductor enters precisely through the minimal Sobolev norm of a suitable vector 
belonging to the space of ir. We note that the test vectors we choose are smooth but 
not JsT-finite at infinite places; this idea has been heavily exploited in the previous 
work of Bernstein and Reznikov. 

Note that some cases of Thm. 16.11 - where tt has trivial central character and 
7r is quadratic - are subsumed by the previous result Thm. 15.11 Nevertheless, we 
have chosen to give a distinct presentation since the method is entirely different, it 
is simpler in the present method to deal with the case of noncuspidal tt. Also, we 
shall consistently deal in the present section with GL(2), rather than PGL2. Thus 
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u> will be a unitary character of Ap/F x , and C^°(Xgl(2)) the space of functions 
on Xql(2) = GL2(F)\GL2(Aj?) with central character ui. 

In the case F = Q, the subconvexity result (|6.2|l for characters is due to Burgess. 
Burgess' method gives a much better exponent; of course there is considerable scope 
for improvement in the present technique also. 

For the ease of the reader, we briefly explain in advance the points of our proof in 
classical language. The discussion that follows is not a completely faithful rendition 
of the proof, but it hopefully conveys the main ideas. While it follows the pattern 
of all the proofs in this paper, one minor complication is that we deal with integrals 
w.r.t. certain measures of infinite mass. 

(1) If / is a Maass form on SL2(Z)\H, the integral 



Jacquet-Langlands theory. The version of this equality that we shall use 
is proved in Lem. 111.81 when / is a cusp form and Lem. 111.101 when / is 
Eisenstcin. 

(2) It will then suffice to bound J2x=i x{ x )f(~ + iy) f° r each fixed value of y. 
As it turns out, the crucial range of y is around y = q~ 1- , the contribution of 
other ys are small for relatively trivial reasons (use the Fourier expansion) . 
This is roughly a geometric form of the approximate functional equation: 
it says that the Fourier coefficients a n (f) with n x q are most important 
to determining the L-function. The general version of this fact is proven in 
Lem. ITOl 

(3) In the range when y x q~ , the set + iy}{i< x <q-i} is roughly equidis- 
tributed, because it is (with the exception of two points) the orbit oiiqy £ H 
by the gth Hecke operator. This is easy to quantify and actually can be 
regarded as a statement about equidistribution of p-adic horocycles.The 
general version of this is proved in Lem. 19.101 

(4) We are now in a situation where we are trying to bound the period of / 
along the roughly equidistributed set + iy}{i< x <q-i}- To do this, we 
apply mixing properties of the adelic torus flow, in the same fashion as the 
previous proofs of this paper. This shows that $^'=i x( x ) f + iy) is small. 

The computations that underlie steps (Q, J2J and © are fairly routine but 
technically complicated. We have therefore carried them out in Sec. 111.41 In the 
sections that follow, we merely quote the results and carry out what amounts to 
step (QJ. 

6.1. Relating integral representations and periods. Let zeM and let fi z , v z , /x, v 
be the measures on X GL ( 2 ) defined by 



(6.3) 




equals, up to some T-factors, A=L(^, f x x). This is an exercise in Hecke- 
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In both cases, the measure d x y is the probability measure invariant by Ap/F x and 
the measure d x z is a Haar measure on R x . Thus li z ,v z are probability measures, 
whereas fi, v have infinite mass. It is simple to see that the integrals defining 
/•*(/)> ^(Z) converge absolutely if / is a function decaying rapidly enough at the 
cusps, e.g. satisfying <C ht(a;)~ e (notation of Sec. I8.2fl . for any e > 0. 

Note also the analogy between these measures and those used in the analysis of 
unipotent periods (cf. 1)3. 2|) .') Classically, v z {f) should be thought of the measure 
on SL 2 (Z)\H defined by X)o<a;< 9 -i /(§ +i z )> an d the measure on SL 2 (Z)\IHI 

defined by ^ 0<1<rl /(| + iz)x(x). (These statements are not to be interpreted 
precisely; they are for intuition only). 

Here is the Proposition that formalizes Q and (0 of the discussion above, in 
the cuspidal case. 

Proposition 6.1. Let n be a cuspidal representation on GL/2(Af ) ; \ a character 
of A£/F x of finite conductor f. Write L unr (s,ir x \) = Y[ Xv un ram. -M s > n x X)> 
where the product is taken over all finite places at which x is not ramified. 

Let d, /3 > 0. Let g+,g~ be positive smooth functions on K>o such that g++g- = 
I, g + (t) = 1 for t > 2 and g_(t) = 1 for all t < 1/2. 

Then there exists tp € ir such that, with 

( "> *" ) = W( "' / "^(..x»ri 

then $(s) is holomorphic and satisfies: 

(1) |$( S )| « KW , e N(f) e and |$(i)| » £ N(f)" e . 

(2) 93 is new at every finite place (i.e., for each finite prime q it is invariant by 
i^o[q s, ] 7 where s q is the local conductor ofir). 

(3) The Sobolev norms of ip satisfy the bounds 

(6.6) S 2 , d , (<P) «- Cond 00 ( 7 r) 2d + e Cond / ( 7 r)' 5 + e Cond 00 (x) 1/2+2d 

(4) The integration of 1)6'. 5)) may 6e "truncated without significant change" to 
the region z around N(f) ; more formally: 



Vz(tp)g+(z/T)d x z 



< (N(f)T)- 1 / 2 (TCond( 7 r)Cond(x)) e 



IJ-z(<f)g~(z/T)d x z 



« (N(f)T) 1 /2(TCond( X )) e (Cond 00 ( X )Cond(^)) 



l+e 



Proof. Lem. ITOI and Lem. HOI □ 
We next give the corresponding result for the "noncuspidal case." We recall 
that the Eisenstein series E^(s,g) associated to a Schwarz function W on A|, are 
discussed in Sec. 1101 The normalization is so that the functional equation inter- 
changes s and 1 — s. E(s,g) denotes, as explained in that section (cf. I|l().9|l ) 
the truncated Eisenstein series obtained by subtracting the constant term; it is a 
function on B(F)\GL 2 {A F ). 

Proposition 6.2. Let so,s' <= C, and suppose that \ is ramified at at least one 
finite place. Let g± be as in Prop, \b\l\ There is an absolute C > (i.e. depending 
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only on F) and a choice of K max -invariant Schwarz function ^ (depending on x) 
so that if we put 

( ' j '" W L( X ,s + S >)L( X) l-s + s>) 

where E is defined as in MU.9\) . then the integral defining $ is absolutely convergent 
when 5R(s),5R(s') 3> 1. Moreover, $ extends from 5R(s), 5R(s') > 1 io a holomorphic 
function on C 2 , satisfying 

(1) |$(s , s' )| » 1 and s')| « Ci+l»WI+l«(- , )l(l + |s| + \ s '\) c . 

Moreover, given N > we have that 

(6.7) ms,s')\(l + \s\ + \s'\) N « KWlK(s0)J v Cond 00 ( X ) N ' 

where N' and the implicit constant may be taken to depend continuously on 
N,U(s),U(s'). 

(2) ^ , and so also E^(s,g) is invariant by K m&x ; 

(3) Let h 6 H(k) be as in $10.18}) . and put Eh := J^ s ^ 1 h(s)E^(s, g)dg. 

Then, for each d, (3, there is N > such that Soo,d,p(Eh) <C K ||/i||oCond o(x) Ar ) 
where the norm \\h\\o is defined in ^lU.ltft) . 

(4) We have fX z (E h ) <K,*,h mm(\z\ K , \z\~ K ) for each 16 K > 1. Moreover, 
there is N > such that 



Li z {E h )g+(z/T)d x z 



« m)T)- 1/2 (TCond(x,m\h\\N 



H z (E h )g_(z/T)d x z 



« (N(f)T) l ' 2 (TCond(x)) e Cond co ( X ) 1+e || „ 



Proof. Lem. llTTul and Lem. I1TTT1 □ 

6.2. Proof of theorem — cuspidal case. Let x be a character of Ap/F x , of 
varying conductor f. Put q — N(f). 

We shall need the following estimate, proved in Lem. I9.lt)! It amounts in essence 
to a statement about the equidistribution of p-adic horocycles (classically, these 
roughly correspond to a statement about the equidistribution of {- + iz}o< x < q -i, 
ifzxg- 1 ). 

For any function / that is invariant by -Ko[ c l s ''L we have (with m = Y[ q Q Sq ) 



(6.8) 



"*(/)- / / 



x 



« e g Q - 1 /2+ eN(m) 3/2+e max{qZ} Lf/*S 24 (f), j £ (joo (x) 

qz 



Proof, (of Thm. 16.11 - cuspidal case.) Choose / £ n to be the 'V" of Prop. 16.11 so 
that \n(f)\ > e N(f)~ 1 / 2 ~ e |i tinr (l/2,7rx x)\- For each ramified prime q of it, let q 3 " 
be the local conductor of the local representation 7r q . Set m :— J| q q Sq , the finite 
conductor of 7r. 

Let JsT > 1 be an integer satisfying K < N(f). Let S be the set of prime ideals 
of Op, with norm lying in [K,2K], and satisfying (n, f) = 1 and (n, Supp(/)) = 1. 



^The implicit constants here are totally unimportant; this estimate will be used only to verify 
that certain integrals converge. 
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Fix no € S. For each prime ideal n <E S, let w n € F n be a uniformizer. We define a 
measure a on GL,2(Af) so that 



(6.9) 



nes 



Clearly a has total mass 1. Moreover, fx(f) = /«(/ * a) and / * a is invariant by 
A' [q S:1 ] for each q|m. 

Choose k "slightly smaller than 1," to be specified later. Our aim is now to cut 
the z integration in /i = J z [i z d x z into three ranges, the crucial range of which will be 
q-2+K ^ z ^ ^-k. avoi^g pain of dealing with the infinite mass measure /x. 
Let g + , <?_ be as in Prop.O Define h(t) by the rule 5 _(_fpj)+/»(t)+ ff+ (-|-) = 1. 



(6.10) | M (/)| 2 = K/* CT )| 2 



-2 + K 



[i z (f*<r)d x z 



h(z)fi z (f *a)d x z 



9+( — )Vz{f*cr)d x z 
q K 



By Prop. 16.11 the first and last term f without the square, e.g. J g + (-^- 7 r)/i z (f -k a)d x z ) 
are <C Cond(7r) 1+e Cond 00 (x) 1+£ g i 2 J - +e . More explicitly, we note that 

*CT) = IST 1 ^2 / i N(n)- 1 N(n ) 2 (/) 



(6.11) 



Now 1/2 < N(n)N(rto) 1 < 2 for all n - this was the purpose of the factors involving 
no in 1|6.9|) - and so one easily deduces a bound on J g-( q _% +K )fj, z (f * a)d x z from 
the final assertion of Prop. 16.11 Similarly for the term involving g + . 

That the first and last terms of l|6.10|l should be less significant may be seen in 
the classical setting from the Fourier expansion; it should be regarded as a geometric 
version of the approximate functional equation). 

As for the intermediate term, we note 



K z )vz{f) = 



h(\y\)x(y)f(a(y)n([f]))d x y . 



i y ek*/F> 

Applying Cauchy-Schwarz, and the fact that J A x /F x h(\y\)d x y <C log(g), we get: 



(6.12) 



<e q 



/F> 

h{z)v z {\f*a\ 2 )d x z 



«e9 £ / \f*a\ 2 d m + q a - K / 2+ *N(m) 3 / 2+e S 2 , d (\f*a\ 2 ), 
where we have applied (|6.8(l . By Lemmas 18.11 and 18.21 

(6.13) S 2 , di/3 (|/*a| 2 ) « S 4 . A(3 (f*a) 2 

« (su Pgesupp(CT) || 5 ||) 2 ^ 4 ,^(/) 2 « K^S w+3/2 (f) 2 . 

where the last line holds for d' 3> d, and we have used Lem. 19.31 (which bounds the 
L°° norm of a cusp form in terms of L 2 norms), together with the easily verified 
fact that sup gesupp((T) ||.g|| < K 2 . 



SPARSE EQUIDISTRIBUTION PROBLEMS, PERIOD BOUNDS, AND SUBCONVEXITY 45 



By bounds towards Ramarmjan, 

(6.14) 11/**=/ (g- 1 g'f,f)da(9)da( 9 ')^K 2a - 1 \\f\\l 2 . 

J g,g' 

Thus: 

(6.15) | M (/)| « £ Cond(Tr)Cond 00 (x)(Cond( X )Cond(Tr)) £ g^ i 

« q e (q {K - 1)/2 + K^ 1 ' 2 + , 7 Q / 2 - K / 4 ^)Cond co ( X ) JV Cond(7r) Ar , 

for appropriate N > 0. We have used Prop. 16.11 (3) at the last step. 

Prop. RTTl guarantees that \L unr (l/2, ttxx)| <C e 3 +£ |m(/)I- From this, optimiz- 
ing k, K, and applying trivial bounds at ramified places, we obtain the conclusion, 
taking a = 3/26. □ 

6.3. Proof of theorem — noncuspidal case. We turn to the proof of l|6.2|) . This 
is very similar, but we implement a mild rcgularization procedure to deal with the 
Eisenstein series, just as in the case of Rankin-Selberg L-functions. 

Proof, (of l|6.2p .) We may assume that \ ramifies at least at one finite place. 

Let $ be a Schwarz function on A|,, E(g,s) :— Ey(g,s) the corresponding 
Eisenstein series, chosen as in Prop. 16.21 with so = 1/2, s' = 0. 

Let k' > 0, let h be holomorphic in an open neighbourhood of the vertical 
strip — «' < SR(s) < 1 + k' and put Eh(s) = J^, s ^_ 1+K , h(s)E(g, s)ds. Then if 

h(0) = h(h) = h(l) = 0, it follows from the third assertion of Prop. 16.21 that 

Soo,dA E h) < e ,d Cond 00 (x) Ar ||/i||o 

for appropriate N = N(d,(3) > 0. Here, as in i|l(J.18|) with n replaced by k', the 
norm \\h\\ N is defined to be (\h(l + k' + it)\ + \h(-n' + it)\) (1 + \t\) N dt. 
Put, in the notation of Prop. IO I ( s ) = $( s , 0)£(x, s)L(x, 1 - s). Then: 

(6.16) [ n z (Eh)d x z = N{f)- 1 / 2 [ h(s)I(s)ds. 

Jz JU(s) = l/2 

This is established in i|11.31|) : for now, we remark that that this is "almost" 
obvious from Proposition l6.2l the only additional point being that one can replace 
E by E, and this is exactly where the fact that x is ramified at a finite place comes 
in - to kill the constant term of the Eisenstein series. 17 

Take h = (s — l/2) 2 s(l — s)I(l — s). The "good" analytic properties of h, e.g. 
rapid decay along vertical lines, follow 18 from 1)6. 7JI . In particular, h belongs to the 
function spaces TL{k) defined in (|10.18l) for any k > 0, and the norms \\h\\N are all 
bounded by suitable powers of Condoo(x).<7. 

Then (|6.16() becomes 

(6.17) ^ t 2 (l/4 + t 2 )\I(l+it)\ 2 =q^ [n z (E h )d*z 



^^The classical version of this fact - see - it is clear that the will kill any constant 

term of /, as long as \ is not trivial. 

18 This point was not clear in a previous version; thanks to N. Bergeron for pointing this out. 
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To bound the right-hand side, we proceed as in Sec. 16.21 but with / replaced 
by Eh- We use notation as in that Section, except replacing the a h" defined before 
(|6.1U|I by 1 — g_ — g + to avoid clashing with its alternate usage here. 

One proves as in that Section, that for d > 1: 



(6.18) 

(1-5- ~ 9+)ftz(Eh*<r) 



« e q< (^- 1 /2 + ^-/4 (sups6supp(CT) 11211)1/2) ^ ^(^ 

« f (^ Q -V2 + ^~"/*Kj ||/i|| Cond co ( X ) / 



for some appropriate N > 0. At the last stage we have applied Prop. !6.2l to control 
the Sobolev norm. 

Prop. 16.21 also guarantees that, for appropriate N > 0, we have: 



(6.19) 



z 



9 

9+(—:)Liz(E h *a)d x z 
q K 



<Cond 00 ( X ) 1+e g— +e \\h\\ 



N, 



Combining (|6.17(l and l|6.19|l . we obtain as in the previous Section the bound, 
for sufficiently large N: 

(6.20) / t 2 (-+t 2 )\L(-+ i t,x)L(--it, X )\ 2 Ml/2 + l t,0)\ 2 



(q^ +K a - 1 ' 2 + q a ' 2 -^K) \\h\\ N q l ^ConA 00 { X ) N 



One applies the convexity bound to bound ||/i||jv, obtaining 

t 2 \L{\ + it, X )\L{\ - it, x)| 2 |<f(l/2 + it, 0)| 2 « Cond^^^g 24 / 25 , 

where we have increased N as necessary. From this we get L(^, %) <C Cond 00 (x) JV </ 1 / 4-1 / 200 - 

□ 

7. Torus periods (II): equidistribution of compact torus orbits. 

It has been independently shown by Zhang Clozel-Ullmo |7| and P. Cohen 
[5j that the subconvexity result Thm. 16.11 implies the equidistribution of Heegner 
points over totally real fields; in particular, they pointed out that GRH implies this 
equidistribution. Thm. 16. II makes this result unconditional. 

The main aim of this section is to explain how one can obtain certain conditional 
results about equidistribution of subsets of Heegner points, and how this fits into 
the general framework of "sparse equidistribution questions." In particular, this 
approach does not rely on reducing questions about subsets of Heegner points to 
subconvexity, but rather approaches the equidistribution question directly. 

The proofs of the results (and various supporting Lemmas) will only be sketched, 
and we will confine ourselves for simplicity to the case of narrow class number 1; we 
will in any case present an unconditional approach, based on combining the ideas 
of this paper with the ideas of Michel, in the paper (joint with P. Michel). 
We nevertheless feel that the ideas presented here may be of use in other contexts. 
Indeed, this section is of a different flavor to the other Sections; it uses "adelic 
analysis" more genuinely. 
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In fact, we shall need a mild refinement of the results of [Zj, which allow better 
control of the dependence on the test vectors. We state this refinement without 
proof in Thm. 17.11 the proof is an exercise in explicating some of the proofs in [7J . 

7.1. Equidistribution of Heegner points. We recall the definition of Heegner 
points. Let F be a totally real number field of degree d over Q. For simplicity we 
shall confine ourselves to the case where the ring of integers of F has narrow class 
number 1. This assumption does not change any of the technical details, which are 
in any case carried out adelically; it simply allows us to be a little more explicit 
about the torus orbits we consider. Let E = F(\/—d) be a totally imaginary 
quadratic extension of F, where d £ o_f is totally positive and squarefree. Here 
"squarefree" means that it is of valuation < 1 at all finite places. 

Let Te be the torus Res^/p'(G rra )/G m ; we embed Te in PGL2 via (in obvious 
notation): 



(7.1) 



x y 
-yd x 



Regard d as an element of F ® R via the inclusion F F <£> R. Since it is 
totally positive, it possesses a unique totally positive square root, Vd G R. Set 

[d]oo = (q ) 6 PGL2 ( F R )- We define a map ^ '■ t e(&f)/T e (F) -> X 

via 

(7.2) raji-ftBfcHdJoo, 

where we regard [d]oo C PGL2(F<g> R) C PGL2(Af) acting by right translation on 
X. Denote by N(d) the absolute norm of d, i.e. N(d) = |oF/doF|- 

The -F-torus Te is anisotropic, and there is a unique TE(Aj?)-invariant probabil- 
ity measure on Te(Af)/Te(F). Let ve be its image by the map .Jf 7 . 

Theorem 7.1. Set E — F(-\f—a), where d £ Of is totally positive and squarefree. 
The measures ve become equidistributed as N(d) — > 00. Indeed, there exist 5 > 
0, d, P such that for f € C°°(X) we have 



fdv E - / f(x)dx 



x 



« N(d)- 4 S* d «(/) 



Recall the definition of S* from Sec. 12.101 We do not give the proof; as we 
have remarked it can be obtained by following the computations of [7] a little more 
explicitly. 

One recovers from Thm. 17.11 the equidistribution of certain Heegner points as- 
sociated to E = F(y—d) as d varies. Thm. 17.11 also gives an effective rate of 
equidistribution for Heegner points with polynomial dependence on the level and 
the eigenvalue of a test function. This rather innocuous polynomial dependence 
(in the level aspect, at least) will in fact play a crucial role in our deduction of the 
equidistribution of sparse subsets in the following section. 

7.2. Equidistribution of subsets of Heegner points. We turn to certain con- 
ditional results on equidistribution of sparse subsets. F being as in Sec. 17.11 let 
Ei = F(^/d~i) be a sequence of distinct quadratic, totally imaginary, extensions of 
F. For each E%, let Si C Te^Af) /Te^F) be a subgroup of finite index mj. Let 
fj, E \ be the image of the Haar probability measure on Si by the map . 
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The import of the next theorem is that, if Ei has enough small split primes, one 
can obtain the equidistribution of the measures n E ', as i — > oo. This result is quite 
similar to the results of Duke-Friedlander-Iwaniec [TO] in the case F = Q, although 
the method is at least superficially rather different. One can also contrast with 
Michel's striking result, for F = Q, that gives a comparable result but without the 
condition on enough small split primes. Our method is different to Michel, who 
deduces the result from his subconvexity bound for Rankin-Selberg //-functions. 
19 In the present approach, we prove the equidistribution theorem directly. In a 
sequel to this paper, the author and P. Michel combine the methods here with some 
methods developed by Michel to make the results of this section unconditional. 

To quantify the existence of enough small split primes, one might impose the 
condition (as does Linnik |2l)|) that the Ei vary through a sequence of quadratic 
extensions that split at a fixed prime of F. We will prefer to take a more quantitative 
approach, which will yield a stronger result at the price of a stronger assumption. 
In that regard we introduce the following notation: For 5 > 0, we put 

wt(Ei,6) = #{q C o F prime and split in Ei,N(di) s < N(q) < 2N(d l ) 5 }. 

Theorem 7.2. There exists S x > such that, if min(N(d . ) i 1 (i/2"' n ) |Wt ( g . [ g l) i/2 ) ~* °> 
the sequence [i E \ converges, as i — > oo, to the invariant measure on X. 

Proof. This is deduced from Thm. 17.11 by using the mixing properties of the 
Te(Af)-{[ow. Indeed, we fix an index i and a corresponding field Ei. Let <5i > 
be fixed. Let S be the set of prime ideals of F which split in Ei and with norm in 
[N(d i ) <5l ,2N(d l )< 51 ]. For each q G S, the torus T Et (F q ) is isomorphic to F q x . Fix an 
isomorphism T q : T^Fq) — > F q x , and let be an element in Tj^Fq) such that 
Tq(tn q ) has valuation ±1 in F q x . 

Let x be a character of TE i (Ap)/TE i (F), trivial on 5,;. Let VE t be as defined 
prior to Thm. 17.11 and define 

/**(/)= / f(Jf(t)) X (t)dt, 

Jt<£T Ei (A F )/T Et (F) 

where dt is the Haar probability measure on TE i (Ap)/TE i (F). Let a be the prob- 
ability measure ^ Y, q es x( TO q)^, on T Ei (A F ). Then (i Ei (f) = M-E<(/ * 
where Jtf^cr denotes the image of a by the map Jff. 
By Cauchy-Schwarz, and Thm. 17.11 

(7.3) \^{f*-^)\ 2 < VE t (\f*J%<r\ 2 ) 

< \\f*^Mh+0{n^)- 5 S^(\f*J^a\ 2 )) , 

where 5, d, [3 are as in Thm. 17.11 Now, appropriate variants of Lem. 18.11 and 18.21 

(for S* instead of S) show that 

(7-4) S^df + JKal 2 ) « S^^U-kJZ&f 

« sup iMr^ di/3 (/) 2 « m^s^f) 2 



^We note that this bound of Michel is considerably deeper than 15.31 . since it deals with 
varying central character. For some speculative discussion on the "reason" that Michel's method 
can avoid this condition, see the last paragraph of 1241 . 
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and bounds towards Ramanujan show that 

(7.5) \\f*jn<rf L * « (Nidi) 51 ^-^ + i^r 1 ) \\f\\h- 

We note that l|7.4|l and (|7.5(l are very closely analogous to (|6.13(l and (|6.14(l , with 
K replaced by N(d,) 1 . In the context of (|6.14[1 . the set S has size A" 1 " 6 ; thus the 
term |iS| -1 that appears in (|7.5|l could be neglected. 

Recalling the definition of [iEi , we conclude 



(7.6) 



f{Jf{t)) X (t)dt 



ltGT E . (A F )/T E .(F) 

« (N(d,) 35l/3 - 5/2 + N(d,) 5l(Q - 1/2) + l^r 1 / 2 ) S^^(f). 

Summing the left-hand side of (|7.6(l over all mt characters \ of Te^Af) /Te ; (F) 
that are trivial on Si, and substituting \S\ ~ wt(Ei, 5i), we obtain: 



Si 

Me" 



(/)| « rrn (N(d,) 35l/3 - 5/2 +N(d l ) 5l(Q - 1/2) +wt(A 4 ,5 1 )- 1 / 2 ) S^U). 



Choosing 5\ sufficiently small (the exact value will depend on the value of /3, 8 
from Thm. I7.1|l we obtain the claimed conclusion. □ 



8. Background on Sobolev norms and reduction theory. 

The rest of the paper consists of technical Lemmas. The sections that follow are 
arranged to be used as a reference, rather than to be read through. 

8.1. Formal properties of the Sobolev norms. We begin by explicating certain 
formal properties of the Sobolev norms defined in Sec. 12.9.31 

Remark 8.1. The following properties of this definition are formal and will be 
repeatedly used: 

(1) Translations by K ma ^ G preserve S Ptd ,p, i.e. S p<d ,p{k ■ f) = S p ,d,p(f) for 

k e Amax,G- 

(2) If L : C*2°(X G ) -> C is a linear functional and \L(ip)\ < PS p , dt0 (i)), 
then also |L(V0I < S p dp(?P)- Indeed ip > |L(V>)I is itself a seminorm 
on CS°(X G ). 

(3) Suppose that E : C£°(Xg) — > C^°(Xg) is a linear endomorphism satisfy- 
ing PS Ptd ,p(Ef) < A-PS p , d ,p(f), for some A e K. Then also S Ptd ,p(Ef) < 
AS p>d< p(f). Indeed, / i— > A~ 1 S p .d,p{Ef) is a seminorm dominated by 
PS p<dt p. 

(4) We shall need a slight variant of © in the case where we are studying only 
the space of / with some invariance property. 

Suppose that E : C2°(Xg) — ¥ C2°(Xg) is a linear endomorphism, M 
is a finite set of finite places, and for each v € M we are given an open 
compact Ki iV C K v . Suppose moreover that PS Pv d,/3{Ef) < A ■ PS Ptdl /s(f) 
for some AeM and for all / which are Ho pat ^i,irfixed. Then, for all / 
which are IIueM ATi ^-fixed, we have in fact 

S P ,dAEf) <^H [K v : K liV fS Pidi0 (f). 

«6M 
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Indeed, put Ki t M = YiveM F\, v and let II be the averaging operator 
J keKl M ir(k)dk, where -K"i,m is endowed with the Haar probability mea- 
sure. Then apply © above to the operator / i— > E(Uf). 

Lemma 8.1. Let F l e C£(Xg), F 2 e C~(X G ). Then 

Sp,d,/3{FlF2) <^d S2p,d,/3(Fi)S2p,d,f3(F2)- 

Note thatFiF 2 GC^ 2 (X G ). 

Proof. Put F = FiF 2 . For any monomial T> of degree d in i3, we can write 
T>(FiF 2 ) = X] 0! Gi('^'a I i^ 7 'i)('^'a-2^2), where a range over an index set I whose size 
is bounded by a constant depending only on d, and the T> ark are certain monomials 
in B satisfying ord(2? Qj i) + ord(2? Q .2) = d. It follows that 

Applying Cauchy-Schwarz, we conclude 

(8.1) ll^llLP(X G . a d) ^ 22 ll^aa^l||L>(X G , ad )ll^a : 2^2||L2p(X G , ad ) 

Clearly, for each finite place v, we have K V ^ F 3 Fv,F t H K V ^ F2 ; i n particular 
[K max>G ■ K F ] < [K niaXtG : K Fl ][K maXj G ■ Kf 2 }. It follows that 

(8.2) [K max , G : K F fJ2\\VF\\ LH ^ d) 

v 

« |[if m ax,c : Kf^YsWVFiWl^ {[K ma ^ G : Kp 2 f £ H^alU* J , 

where the implicit constant depends only on d, and in all three instances T> varies 
over the set of monomials in B of degree < d. 

That is to say, there is a constant C — C(d) such that 

PS pA ,p{F x F 2 ) < C ■ PS 2p4 AFi)PS2 P 4AF2)- 
From (|2.11() we deduce 

S p ,d,p{FlF2) < C ■ S2 p ,d,f}{.Fl)^2 P ,d,p(F2), 

as required. □ 
We recall the definition of \\g\\ for g € G(Foo), G(A F ) etc. from Sec. 12.41 

Lemma 8.2. Let F e C°°(X G ) and g = (300,5/) € G(A F ). 

S Ptdi0 (9-F)< \\goo\\ d \\g f \fS p , d .AF). 

Proof. Put F' = (<7oo, gf) -F, where gj = {g v ) v finite- For each finite place v, we note 
that K v>F i 2 gvK VjF g~ l H -K^g. The index [if^.c : K ViF >] is therefore bounded 
above by the number of cosets xg v K v ^ F in K Vi g9vK ViF . Clearly this is bounded 
above by the number of left K V<F cosets in K Vy GgvK V: G', but the number of such 
cosets is precisely • [K v> g ■ K V , F ]. It now follows easily from the definitions that 
PS d , PJ {F') < \\ goo \\ d \\g f fPS d: i3j(F). Applying Rem. Oto the endomorphism 
F i— > (ffoo,3/) ■ F, we obtain the claim. □ 
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The following crude Lemma is as much of interpolation as we need. It will be 
applied, in practice, where E is a composite of a Hecke operator and a certain 
L 2 -projection. 

Lemma 8.3. Let E be a linear endomorphism o/C i 2°(Xg) which commutes with 
GK-Foo) x -Kmax.G- Suppose there are real numbers A, B > such that for any f G 
C£°(X G ) 7 we have \\Ef\\& < A\\f\\&, \\Ef\\ L ~ < B\\f\\ L »o. Then for 2 < p < oo: 

S P ,d,p(Ev) < A 2 ' p B l -lS pA0 {v). 

(We admit also B — oo, in which case the L°° hypothesis should be seen as void, 
and the result becomes S2,d,p{Ev) < AS2,d,p(v)-) 

Proof. By interpolation, the operator norm of E w.r.t. the LP norm on C2°(Xg) 
is < A 2 1 v B x ~' 2 1 p . Moreover, the assumption on E shows that Kej Z> Kt. 
It follows that for / S C2°(Xg) we have the inequality 

PS pM3 {Ef) < A^B^/PPSp^if)- 
Rem. 18. II implies the conclusion. □ 

8.1.1. Computing Sobolev norms in the Kirillov model. In the present section, let 
v be an archimedean place of F. 

Let tt v be a generic unitary irreducible representation of Gh 2 (F v ). Recall that 
this means that tt v is realized in a space of functions (the Kirillov model, con- 
sisting of restrictions of functions in the Whittaker model to the diagonal torus) on 
F* . Recall also the definition of the local conductor Cond t ,(7r 1) ) from Sec. 12.12.21 

In this model, the diagonal torus acts by translation and upper triangular matri- 
ces act through multiplication by characters: that is to say, for / G JfT, yx, y% G F* , 
z G F v we have the rules 

(8.3) n(a(yi))f : y 2 >-> f(yiV2),^(n(z))f : y 2 h-> f{y2)e Fv (zy 2 ). 

From these facts it is easy to verify that the space of smooth vectors in n v contains 
all compactly supported smooth functions on F* . Moreover, 

(8.4) 11/111=/ \f(y)\ 2 d*y 



defines a GL2(i 7 ' l ,)-invariant inner product on jff . 

We will eventually have occasion to choose test vectors in ir v in this model, and 
wish to evaluate the "Sobolev norms" of the resulting vectors. 

Lemma 8.4. Suppose F v = R. Let f G be C°° and compactly supported. Then 

1/2 



£ ||X>/||a«Cand*(7iy, 

ord(V)<k 




-l\2k 





d x y\ 


d j y 





where the T> sum ranges over all monomials in a fixed basis for Lie(GL2(-F\,)) of 
degree < k. 

Suppose F v = C, and suppose f G ,Jfc is C°° and compactly supported. Then 

i+3 2 \ V2 

i CW.Lu,..* [ V / ( :| |.: V ' rrlJ 

ord(X>)<fc 




d l zdiz 
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Proof. We prove only the case with F v = R, the complex case being similar. 

Let h, e, /, z be nonzero elements of the (real) Lie algebras of GL2(M), defined 
via 

h= (o -°i)' e= (o o) ,/= (i o)' z ^(o l) 

These satisfy the usual commutation relations [h, e] — 2e, [h, f] — 2f, [e, f] = h. 
Let A be the scalar by which the Casimir operator i/i 2 + ef + fe acts, and v the 
scalar by which z acts; then 1 + |A| + \v\ 2 <C Cond„(7r.„) 2 . 

It is easy to see how h, e act on h acts by a multiple of the differential 
operator civ + y-^j and e acts by multiplication by c^y, for some constants ci, C2, C3. 
The Casimir operator jh 2 + ef + fe = jh 2 + 2ef — h acts by the scalar A; so it 
follows that for v £ ,y(f we have efv = |(A + h— h 2 )v. In particular, e acts on any 
compactly supported function via the differential operator c' 1 y~ 1 + c' 2 ^ + c' 3 y-£^, 
for certain constants c^c^Cg, satisfying \c[\, \c' 2 \, \c' 3 \ <C Cond t ,(7r 1 ,) 2 . (In fact, 
IcJI < Cond^Tr,,) 3 - 2 .) 

Any monomial of degree fc in ft,, e, /, z is therefore a sum of terms c 1 $y 1 dy, where 
\c 7 s\ -C Cond„(7r 1 ,) 2fe , |^| < fc, <5 < 2fc. The claimed result follows in the case F v = K. 

A similar proof holds for F„ = C. □ 



8.2. Reduction theory. Recall that -F00 := f ®qK. Let i^oo , K v , K max be as in 
Sec. 12.51 Then Koo x K max is a maximal compact subgroup of GL2(Af). Given 

g G GL2(Ai?) we may always write .9 = ^ q l ^ ^ y ) ^' w ^ ^ ^ ^F,x,y € 

Al fc S i^oo x if max . We set ht(g) = 1 1 a; this is well-defined, although x, y are 
not unique. 

Then ht descends to a function _B(i 7 ')\GL2(Ai?) — > M>o- Explicitly, 



(8.5) ht 



a b \ |od — 6c| A 



where one defines ||(c„,d„)|| = max(|c v |„, |d.„|„) for v finite, and 

(8.6) 11 (c, 4)n = (Kiy^) + \d v \i /des{v) ) des{v)/2 

for u infinite, where deg(u) = [F v : R]. 

Define 6(T) C 5(F)\GL 2 (A F ) to be 6(T) := {5 : ht( 5 ) > T}. Then, for all 
T > the natural projection II : 6(T) -> GL 2 (i ;l )\GL2(A F ) has finite fibers; for 
sufficiently large T, it is injective, and for sufficiently small T it is surjective. This 
is the content of reduction theory for GL2 . As a consequence, the complement of 
n(S(T)) has compact closure, modulo the center, for each T. 

Fix T such that II : 6(T ) -> GL 2 (F)\GL 2 (A_f) is injective. Then we define a 
function ht : GL 2 (F)\GL 2 (A F ) -> K via the rule 



ht( 5 ) 



ht( 5 '),if 9 = n(. 9 ') for some g> E 6(T ), 
Tq, else. 



In fact, it is clear that ht descends to a function X = PGL2(F)\PGL2(A F ) 

°>T - 
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Lemma 8.5. Let U C GL^-Foo) be compact and x £ Xql(2)- The fibers of the 
— * X G l(2) defined by (u,k) i— > xuk have size bounded by 0(ht(x)), 
where the implicit constant depends on U . 

Proof. Suppose g £ GL 2 (Ap) is alift of x £ X GL ( 2 )- Consider the map U x Km&x y 
Xql(2) given by (it, k) i— > guk, as above. Let (it, k) belong to a fiber of maximal 
size. Call this size M. Then 

(8.7) At = #{7 £ GL 2 (F) : guk = jguk', 3 v! G U, k' £ KT max } 

< #{7 : 3 u"fc" = 7ff ,3 u" e • U~\k" £ K max }. 

Set V = £/ • J/ -1 , a compact subset of GL 2 (F 00 ). The definition of 6(T) shows 
that there exists a constant c < I , depending on V, such that 6(T) • V ■ K max C 
<3(cT). Choose T so large that the projection &{cT) — > Xq L ( 2 ) is injective. It will 
suffice to show, for each g £ &(T), that 

(8.8) #{7 e GL 2 (F) : 75 e gVK max } « ht(. 9 ). 

Both g and gl^-ftT belong entirely to &(cT). By the choice of T, 717 S gVK max 
implies 7 in B{F). Write 7 = a 7 n 7 , with a 7 £ A(F) and n 7 6 N(F); also, 
write g = n g a g k g with n g e N(Ap),a g £ A(Ap),k g £ x K max . We are free 
to adjust g on the left by an element of N(F), since doing so will not affect the 
cardinality of the set {7 £ GL 2 (F) : jg £ gVK max }. We may thereby assume that 
n g lies in a fixed compact subset of N(Ap). Thus we can write g = a g k' g , where 
k' := a g ~ 1 n g a g k g lies in a certain fixed compact subset f2 of GL 2 (A^ ). 

Now, 717 £ gVK implies that a~ l a 1 n 1 a g £ f2Vlf max f2 _1 . Noting that a~ 1 a 1 n~ 1 a g 
a 1 a~ 1 n 1 a g , we deduce that a 7 lies in a fixed compact subset of GL 2 (Aj?), depend- 
ing only on U ; thus the number of possibilities for a 7 are <Ci/ 1. Moreover, it now 
follows that a~ 1 n~ l a g lies in a compact subset of Af depending only on U . 

Thus, if we write a s = ^ q ^ ^ ' 71-7 = ^ ^ )' ^ en @ ^ xy~ l Q! ', where 
ft' C Ap is a compact subset that depends only on U. It is easy to see that the 
number of possibilities for [3 is <C[/ I + |xi/ |a f - But |x?/ _1 |a f = ht(^), which is a 
function that is bounded away from zero, and we are done. □ 

Lemma 8.6. Let notations be as in the previous Lemma \8.5l Consider the com- 
posite map U ■ -Kmax - * Xq L ( 2 ) — > X. Each fiber of this map may be written as the 
union of at most 0(ht(cc)) sets each of the form yZ(Ap) n UK max , where Z is the 
center o/GL 2 and y £ GL 2 (A^). 

Proof. Let x be the image of x in X. Let it, v! £ U, k, k! £ K max . Suppose that 
xuk = xu'k' in X. Then there is z £ A F and 7 £ GL 2 (F) such that 

(8.9) xuk — jxu'k'a(z,z), equality in GL 2 (A^) 

For fixed it, k and 7, the set of it'fc' satisfying (|8.9() is visibly the intersection of 
UK max with a fixed Z(AF)-coset. This coset depends only on the class of 7 in 
PGL 2 (F), so it suffices to show that those 7 £ GL 2 (F) that occur in equalities such 
as (|8.9|l for varying u,k,u',k' represent at most 0(ht(a;)) distinct cosets jZ(F) in 
PGL 2 (F). 

Taking determinant followed by the norm A F /F X — ► R, we conclude that \z\a 
belongs to a compact subset of R x that depends only on U. The norm map 
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A F /F X — » M x being proper, it follows that z itself belongs to a compact subset 
A F /F X that depends only on U. 

In particular, there is a compact subset f2 C F^, depending only on U, and 
a finite subset P C A F , containing 1 and also depending only on U, such that 
z E F X Q.P. Y[ v finitc 0p v . Let U = U ■ {0(200, z^,) : z x E ft}. Given a solution to 
ESI, write z = Sz^po, with S E F x , Zca E tt,p E P, o E lit, finite °F,v Then 
xuk = 7a((5, S)xa(p,p)u'a(z OQ , z co )fc'a(o, o), 

in particular, taking u = u'a(z 00 , Zoo) E U,k" = k'a(o,o) E K max , the image of 
a;a(p,p)itfc" in Xg coincides with xuk. So the number of possibilities for the Z(F)- 
coset of 7 is bounded above by the fibers of the map P x U x K max — > Xg given 
by (p, m, fc) — ► xa(p,p)uk. The result follows from Lem. 18.51 

□ 

We shall now need a quantitative version of certain statements in reduction 
theory. The subsequent Lemma is a fancier version of the following statement: the 
number of 7 E SL(2, Z) that map a fixed zGito the Siegel set 

{x + iy : < x < l,y > T} 

is < 1 + T- 1 . 

Lemma 8.7. Let g E GL2(Aj?) and Y > a positive real number. Then 

(8.10) #{7 G B(F)\GL 2 (F) : ht(jg) > Y} < e 1 + Y^'' . 

Here the implicit constant is independent of g. Moreover, suppose g E &(T) with 
T > 1. Then: 

(8.11) sup{ht( 73 ) : 7 i B(F)} < T-\ 

Proof. The proof of l|8.10|l is not difficult, generalizing in a straightforward way the 
proof with F = Q. However, it is somewhat notationally tedious; the (hypothetical) 
reader may wish to simply work out the proof for F = Q, where it is equivalent to 
the following fact: the number of primitive vectors in a unimodular sublattice of R 2 
that are contained in an i?-ball is <C (1 + R 2 ), uniformly in the lattice. (The result 
also be deduced if one admits some basic facts from the theory of Eisenstein series 
over F, but we wish to rather deduce these basic facts from the present Lemma). 
We also remark that the entire content of l|8.10|l lies in the uniformity in g. 

Without loss of generality, we take g E 6 (To), where T is sufficiently small 

that the map 6(T ) — > X GL ( 2 ) is surjective. So g = ^ ^ 1 J f 2/)^ W ^ 
|2^/ -1 |a > T). Moreover, replacing g by gz, for any z E Z(Af) does not affect the 
problem, so we may take y = 1. Then, for 7 = ( ° ^ ^ € GL2(T), we have 

(8.12) ht( 75 )- |rk - 



n„ II (^c,ct„ 

The equivalence class of 7 in B(T)\GL2(T) depends only the pair (c, d) E F 2 , 
considered up to T x equivalence (i.e., it depends only on c/d E F (J {00}.) It 
suffices, then, to estimate the number 

(8.13) #{[c : d] E ¥ 1 {F),Y[\\x v c,ct v + d\\ 2 v < Y' 1 ^}, 
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If Q is any fixed compact subset of A F , then for u) G Q we have Y[ v \\x v ujc, ct v + 
d\\ v x S! Yl v \\x v c,ct v +d\\ v . Consider K >0 as embedded in A F viaK>o Aq A£. 
Then there is a compact subset f2 G A£, such that A^, — F x ■ ■ M>o. 

The size of <|8.13[) is unaffected by the substitution (x,t) i— > (xT,tr), for any 
r G F x . In view of the above remarks we may assume - decreasing Y by a 
constant that depends only on F - that x G R>o- Moreover, the size of l|8.13ll is 
also unaffected by the substitution t ^ t + t, for r G F. We may therefore assume 
that |t|„ < 1 for all finite places v. 

Fix a set of representatives Zi, ■ ■ ■ ,-3 ft for the class group of Of; we will assume 
each 3j is integral. For any [c : d] G P 1 (F) , we may find a representative (c, d) so 
that the ideal cOf + doF is one of the Zi] moreover, replacing (c, d) G 3? by (ec, ed) 
for e G 0^ does not change the class [c : d]. 

The restrictions on x, t imply that \\{x v c,ct v + d)\\ v — \\(c,d)\\ v for all finite v. 
Then n« finite II ( c ' d)\\ v — N(5i) _1 ) the inverse of the norm of Zi = cOf + doF- So it 
will suffice to bound, for each 1 < i < h, the quantity 

#{(c, d) G 2l/o F : co F + do F = Zi ■ J\(\xv\ 2 \c\l + \ct v + d\ 2 v ) < y _1 |a;| A N(3)?}. 

oo\v 

Since Zi belongs to a finite set, the quantity N(3) is bounded; thus, decreasing 
Y again as necessary, it suffices to estimate for each 1 < i < h 

#{(c,d) G Zi/o F : cop + dop =Z,, ]J \\(xvc, ct v + d)\\l < F _1 |x|a} 

oo\v 

There is only one term corresponding to c = 0. Otherwise, (c) is a principal 
ideal divisible by Zi', let V be the set of integral principal ideals. Then the size of 
the set above is precisely 

(8.14) Yi, #{deZ l :(c) + d0F=Z l ,Yl\\{x v c,ct v + d)\\ 2 v <Y~ 1 \x\ A } 

(c)<£V oo\v 

We note that the size of the inner set is independent of the choice of generator for 
the principal ideal (c). Moreover, the inequality of l|8.14|l implies that the norm 
N((c)) of the principal ideal (c) satisfies N((c)) 2 < V -1 ]^ 1 . 

Let us estimate the number of d that can correspond to a fixed principal ideal 
(c) in 18.1411 . Recall that \x\& > To and that x& is in the image of the embedding 
M>o <— » Aq «-» A F . In particular, |je|„ is bounded below at each infinite place. 
Moreover, since o F is a cocompact subgroup of the elements of F^ with norm 1, 
we can choose a representative for the principal ideal (c) so the same is true of \c\ v . 
Note that (cf. I8.6fl that || (x v c, ct v + d)\\ v x (|a;^c| w + \ct v + d v \ v ). 

So in fact, again decreasing Y as necessary, it will suffice to estimate 

(8.15) ]T #{d€Zi : H(l + \ct + d\ v ) 2 KY- 1 ^} 

(c)eV:~N{c)<Y- 1 / 2 \x\- 1/2 °°\ v 

To estimate the right-hand side, first observe that if {Afti}i»|t is any set of 
positive real numbers indexed by the infinite places of F, then #{d G Zi '■ \ct + d\ v < 
M v for oo\v} <C n o o|'u(^ + M v ). Indeed, by subtraction, it will suffice to estimate 
#{d G Zi ■ \d\ v < 2M„ for oo|w}; this amounts to counting points in the lattice 
Zi C in a region that is the product of a box and a disc; the result is then clear. 
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Next, ifT> 1, then the subset {(yi,..., yd) ■■H i (l+Vi) <T} inK^ is contained 
in the union of O e (T e ) boxes {(yi, . . . ,yd) ■ Vi < Mi}, where 1^(1 + Mi) < T. We 
may assume F _1 |x|a > 1, else (18. 1511 has no solutions. We conclude that the 
number of d attached to each principal ideal (c) in (|8.15|) is <C e (Y^ 1 / 2 \x\ 1 J 2 ) 1+e . 

The number of possibilities for (c) is bounded by the number of integral ideals 
with norm < y _1 / 2 |a;| A 1//2 , which is <C C (F _1 / 2 |a;|^ 1 ^ 2 ) 1+e . Finally there is one 
class with c = 0. We conclude that the number of pairs (c, d) up to equivalence is 
< Y- 1 -' + 1. This proves (|8~TU|) . 



As for (|8.11(l . suppose g 6 &{T), so we may write g = f ^ y ) ^ w ^ ^ e 
#oo x if max , and Ix?/" 1 ^ > T. Suppose j = ( " £ ) . If 7 </ then 



a' 7^ 0. In that case, following the notation of l|8. 5JI . we have: 

II M a 'v,{3v)gv\\ > Y[ Wv x v\v = \x\a- 

v v 

and therefore, by |8~5)l . ht( 7ff ) < | det( 5 )x" 2 | A < T _1 . □ 

9. Background on quantitative equidistribution results. 

The aim of this section is to quantify various standard equidistribution results 
(equidistribution of long horocycles, Hecke points, etc.), using the adelic Sobolev 
norms. As such neither the results nor the methods are new; we just collect together 
those results we need and provide brief proofs. 

As regards the origin of the ideas used here, we have drawn in particular from 
the work of Clozel-Ullmo, Linnik, Oh, Margulis, Ratner and Sarnak. 

9.1. Decay of matrix coefficients. 

9.1.1. Local setting. Our fundamental tool in establishing all these results is the 
spectral gap, i.e., quantitative mixing properties of real and p-adic flow. As such, 
we begin by recalling the basic relevant bound on matrix coefficients. 

Let < a < 1/2. Let v be a place of F, and suppose that (V,n) is a unitary 
representation of Gh2(F v ) which does not contain, in its spectral decomposition, 
any complementary series with parameter > a. (More formally: V does not weakly 
contain such a representation). Thus a = corresponds to V being tempered, and 
a = 1/2 corresponds to V having no almost invariant vectors. 

Then for w\ , u>2 any two i^-fmite elements of V, satisfying (wx,Wi) — (u>2, W2) = 
1, and any x € F v we have the bound on matrix coefficients given by 

(9.1) (7r(a(a;)i0i.,iua) < £ ,f dim(K vWl ) 1/2 dim(K v w 2 ) 1/2 {l + \x\ v ) a ~ 1/2+t ■ 

The implicit constant of 19. If) depends only on e. Since we do not know of an 
available reference, we briefly sketch an argument for Ij9.1|l . In the case where 
a = 0, i.e. V is tempered, then (|9.1(l is proven in |H]. In the general case, let 
( (J i/2-ai W) be the complementary series with parameter 1/2 — a; let v° G W be 
a unitary spherical vector. Then the representation V <£> W is tempered. Indeed 
it suffices - again by |S] - to verify that a dense set of matrix coefficients are in 
L 2+e , which follows by direct computation. Now one may estimate the matrix 
coefficient (a(x)wi ® v°,W2 <£> v°) by appealing again to [Hj- On the other hand 
(a(x)wi ® v , W2 ® v ) — (a(x)wi, W2) (a(x)v° , v ), and an easy computation shows 
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that {a(x)v°,v°) > e (1 + \x\ v )~ a ~ t . Thus (|9.1|) follows. (This argument is a variant 
of an argument that appears at the end of 

Let us record a useful further variant. Suppose v is finite. Let K±, K 2 C K v be 
subgroups and let a be the (K±, if 2 )-bi-invariant probability measure supported on 
K ia {x)K 2 . 

Then 



(9.2) \\v*ah< [Kv-.K^Kv-.K^il + ^y- 1 ' 2 ^ 



Indeed, for i = 1, 2 let 11^ be the projection operator w i— > J K kw on V, where Ki 
is endowed with the Haar probability measure. Then: 

(v-kcr,w) 

(9.3) i>*cr2 = sup — n — n 

wev IM| 2 

(a(x)U Kl V,U K2 w) V\X/2\v , I I Na-l/2+ell II 

= sup — < [K v : Ki ' [K v : K 2 \ 1 (1 + \x\ v ) 1 + \\v\\ 2 

wev \\w\\2 

9.1.2. Variant for T\SL 2 (K). Let < a < 1/2, suppose G = SL 2 (M), and let V be a 
unitary representation of G such that V does not weakly contain any complementary 
series with parameter > a. The normalization is again so that a — corresponds 
to tempered and a = 1/2 corresponds to V not having almost invariant vectors. 

Then one has the following variant of (|9.1|1 . proved by the same method: 
(9-4) 

(tt(( y ° lf 2 ) «i,«>2) <e dim(SO(2)- Wl ) 1 / 2 dim(SO(2)-u; 2 ) 1 / 2 (l+| 2 ;|) Q - 1 / 2 + £ 

It is convenient to extend the validity of (|9.4|) beyond the K-Hnite space by 
replacing dim(SO(2)?«i) by appropriate Sobolev norms. We confine ourselves to 
the case of main interest, where V is the orthogonal complements of the constants 
in L 2 (F\SL 2 (]R)), where L is a lattice in SL 2 (M). The estimates we are about to 
describe are, again, not new; estimates for effective mixing of geodesic and horocycle 
flows in this setting are contained in |27j . 

For our purposes it would be optimal to use fractional Sobolev norms; since we 
have not defined these, we shall use a rather crude form of interpolation instead. 

Thus let /i,/ 2 £ C°°(r\SL 2 (R)). One expands both /i and f 2 into a sum of 
SO(2)-types and applies (|9.4() . Indeed, write for i 6 {1,2} an expansion /, = 

y^°° _ fi"\ where transforms under the character ( C °?^L 
^ n =-ooJi > J i y -sm(6») cos(6») 

e ln6 '. Expanding: 

« e (i+|y|) a - 1 / 2+e EH/i ( " ) ll2ll/ M lb = (i+l2/l) a - 1/2+e 

Our definitions of the Sobolev norms (Sec. I2.9.2|l are so that S^.iC/i) 2 3> S n (l + 
|n|) 2 ||/ 1 n ' ) |||, and similarly for / 2 . On the other hand, it is an elementary estimate 




58 



AKSHAY VENKATESH 



that 

X ^ " p(«)|l I ^ \S^\\ f(")||2/ 



i/rii 2 «e vn/ni2(i 




It follows from this that for any k, k! £ SO(2) we have the matrix coefficient 
bound: 

(9-6) yl Q /2 y % )fc'A,/ 2 )| 

« (1 + |y|) Q - 1/2+e (5 2 . 1 (/i)5 2 . 1 (/2)) 1/2+e ||/i|| 1/2 - e ||/ 2 || 1/2 - e , 

at least for /i,/2 which are SO(2)-finite. But the general case of smooth f%, fa 
follows from density. 

Note that in (§23) that the factor ||/i|| 1/2 ~ e S , 24(/i) 1/2+e is a crude substitute for 
the fractional (1/2 + e-) Sobolev norm of f\. 

9.2. Pointwise bounds. In this section, we make free use of the adelic Sobolev 
norms introduced in Sec. 12.9.31 We recall the definition S P: d ■= Sp,d,i/p- We also 
recall that in statements of the form \L(f)\ <C S p d(f), for certain linear functionals 
L, we shall allow the implicit constant of <C to depend on p and d without explicit 
mention. 

Lemma 9.1. Let f £ C ( 2°(Xgl(2)) o,nd let x £ Xql(2)- Then, for any p > 2 and 

d> 1, 

(9.7) \f(x)\ « ht(x)VP S Pid (f) 
Moreover, if F S C°°(X x X), and p > 2, d > 1, 

(9.8) / \F(x,x)\dx<$:S p4 (F) 



x 

Proof. As in (|2.1(l . set Kf — Y\ v finit0 K v j, where K v j is the stabilizer of / in 
K v . Fix an open neighbourhood of the identity U C GL2(F QO ). Consider the map 
II : U ■ Kf — » X defined by (u, k) i— > xuk. By Lem. 18.61 the fibers are unions of at 
most 0(ht(x)) sets, each of the form yZ(Ap) fl UKf. Moreover, for any y £ U ■ Kf, 
the measure of {z £ Ap : ya(z, z) £ U ■ Kf} is bounded above by a constant 
depending only on U. Indeed, the set of such z is contained in a fixed compact 
subset of Ap that depends only on U. 

Equip U ■ Kf with the restriction of Haar measure from GL2(Aj?). From the 
preceding paragraph, one easily deduces that the push-forward of this measure to 
X, under (it, k) h- > xuk, is bounded above by C • ht(x) times the measure on X, 
where the constant C depends only on U. 

Then: 

(9.9) f \f(xu)\ p — vol(A"/) _1 [ \f{xuk)\ p dudk 



<C ht(a;)[A max : Kf] / \f(x)\ p d^(x) 

holds with / replaced by T)f , for V any fixed monomial in Lie(GL2(i 7 ' 00 )). The 
standard Sobolev estimate, applied to the function u i— > f(xu) on the real manifold 
U, implies that \f(x)\ <C ht(x) 1 / p 'PS Pt d,i/ P (f) for sufficiently large d. (Indeed, it 
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suffices to take any d > dim(Z7)/2 = 2[F : Q].) Then Remark IO © implies the 
conclusion. 

As for the second conclusion, we proceed in a similar fashion as above (with X 
replaced by X x X) to obtain the estimate \F(x,y)\ < ht(x) 1/p ht(y) 1/p S p ^ d {F) . It 
is easy to see that J x ht(x) 2 / p dx < oo for p > 2, and the conclusion follows. □ 

The next lemma quantifies the rapid decay of a cuspidal function, or more gen- 
erally a truncated automorphic function, in the cusp. Recall that for To > we 
have defined the Siegel domain 6 (To) in Section 15^1 

Lemma 9.2. Let f G C2°(X G l(2))- Put f N (g) = J n{f) \ N{Af) f(ng)dn, where the 
measure on N (F)\N (Ap) is the N(Ap) -invariant probability measure. Then for 
x G 6(T ), p > 2, k > and d > 1, 

(9.10) |/(a;) - f N (x)\ « To ht(x)Vi- k S Ptdtl/p+k (f) 

Proof. We may assume that x <E N(Ap)A(M.)fl(K 00 xK mas ), for some fixed compact 
set f2 C A(Ap). Here A(M) is regarded as a subset of A(F OQ ) via the natural 
inclusion R ► Too. Write accordingly x = naujk, where to G f2. 

Consider the function on Too defined by g(t) = f(n(t)x) — f N (x). It is invariant 
by the lattice A = {t e Op : n(t) G GL 2 (F od )ujK ftu" 1 } , , where Kf is again as in 
(|2.1|l . One sees that, since w belongs to the fixed compact f2, the covolume bound 
vol(Too/A) <C [ATmax : Kf]. Moreover, since A may be regarded as a fractional 
ideal of Op, the homothety class of A lies in a fixed compact set in the space of 
homothety classes of lattices in Too. Also, g(t) defines a function on Too/A, with 
integral 0. 

Suppose now that G is a smooth function on M. d /L, for some d > 1 and some 
lattice L C K d , with integral 0. Let ||G||(i) = sup zgR d/ L |2X7|, where V varies over 
all monomials in d%, . . . , dd of exact order i. Then an elementary argument shows 
that ||G||(o) "C vol(M rf /T)*' d ||G||(j), and the implicit constant may be taken to vary 
continuously with the homothety class of L. 

Apply this lemma to the function g on F x /A, with i = k[F : Q] for some k > 1. 
The norm ||g||(fc[F:Q]), i n the sense of the above paragraph, is bounded, by Lem. 19.11 
and an elementary computation, by ht(x)~ k (ht(x) 1 ^ p PS p ^d'(f)), for some d' 3> 1. 
It follows that 

sup| 5 (t)| « ht(a:) 1 /f- fc PS p , dl (f)[K max : K f ] k = ht(x)^- k PS p , d ,, 1/p+k (f). 

Applying Rem. O ©, we conclude \f{x)-f N (x)\ < ht{x) 1 /P- k S^ d , ,i/ p+fc (/). □ 

Lemma 9.3. Suppose f G C£°(X GL(2) ) is cuspidal. Then Soo^pif) < S 2 , d >,p+3/2{f), 
for sufficiently large d! . 

Proof. By Lem. 19.21 for / cuspidal, applied with p = 2, k = 1, we see that |/(x)| <C 
52, <i, 3/2(/) f° r d S> 1. Applying this inequality to I?/, for T> in the universal 
enveloping algebra of GL 2 (Too), we see that PS 0Ojd ^(f) < PS 2 , d ',p+3/2(f), for d! 
sufficiently large. This equality holds for cuspidal /. 

Let LT be the T 2 -orthogonal projection onto the space of cuspidal functions; then 
II commutes with GL2(Af), and it follows that 

PSooM^f) « ^2,^,^+3/201/) < PS 2 . d ,. 0+3/2 (f) 

for arbitrary / G C^°(X GL (2)). Now Rem. 18.11 |J5J (or, more precisely, a trivial 
modification thereof) implies the conclusion. □ 
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9.3. Equidistribution of long horocycles and closed horospheres. Let G be 

a semisimple group, r C G a lattice, U a unipotent subgroup of G. It is well-known 
that one can prove, in a quantitative fashion, the equidistribution of /7-orbits on 
r\G if U is a horc 'spherical subgroup, i.e. the unipotent radical of a proper parabolic 
subgroup. We shall quantify two instances of this that will be of interest to us. 

We emphasize that neither the results nor the techniques of this section are new; 
we have included proofs only to keep the present paper as self-contained as possible. 

Effective estimates for equidistribution of long horocycles on quotients of SL2 (K) 
are already implicit in the work of Ratner ,25 and 26 , where the effective mixing 
of the horocycle flow is used. We will proceed in a closely related fashion, using the 
mixing property of the Cartan action; again, this is definitely not new and appears 
already, although in a different context, in the doctoral thesis of Margulis (reprinted 
in ED). 

9.3.1. Equidistribution of long horocycles in hyperbolic 2-space. Let L C SL(2,R) 
be a lattice such that L 2 (r\SL(2, M)) does not contain any complementary series 
representation with parameter > a, for any < a < 1/2. (That is: a € [0,1/2) 
is such that all nonzero eigenvalues of the hyperbolic Laplacian —y 2 (d xx + d yy ) on 
L\H 2 are bounded below by 1/4 — a 2 ). 
We define n, a, n as in l|3.1(l . 

The following Lemma quantifies the equidistribution of long horocycles. Results 
of this type are already implicit in |25j and |26j . This problem is analyzed in much 
more detail than we go into, in |35) and jll| . 

Lemma 9.4. Assume F is cocompact, and let xq £ F\SL2(R). 



(9.11) 



- f f(x n(t))dt- f f(g)dg 
1 Jt=o Jr\SL 2 (R) 



-1/2 



« e T— +e 5 oc , 1 (/). 



Proof. The idea (which is certainly not new - cf . remarks at start of Section I9.3|l 
is that, upon flowing a small ball in T\G for a long time by the geodesic flow, it 
turns into a narrow neighbourhood of a long horocycle. One thereby can deduce the 
equidistribution of the long horocycle from the mixing properties of the geodesic 
flow. 

Let N,A,N be the images of n,a,n respectively. Let g\ be a smooth function 
of compact support on the real line, with integral J_ g\(x)dx = 1. It will remain 
fixed for all time throughout our arguments. Fix 1 > 5 > and let gs : M — ► R be 
the convolution of the characteristic function of [0, 1] with gi(x/8)S~ 1 ; that is to 
say 

9s(x) = 6- 1 f 1 9l (^)dt. 



Jt=o s 

Then g$ is a smooth function of integral 1, which is supported in a small interval 
around [0, 1]. 

Define a probability measure fis on r\SL2(M) via the rule 

fi s {f) = S^ 1 / f(x n(x)a(e v )n(z))gs(x)g 1 (y/S)g 1 (z)dxdydz. 

In words, /j$ is a measure supported on a small box around x$; this box has width 
O(l) in the N and N directions, and O(S) in the A direction. When we flow this 
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by A, it will become a measure supported along a box that closely approximates 
an TV-orbit. 

We observe that 

/ii(a(T _1 )/) = \ J f(x a{Ty 1 n(x)a(e y )h(z))gs(x/T)g 1 (y/5)g 1 (Tz)dxdydz. 



On the other hand, for any fixed x\ G r\SL2(M), we note that 
(9.12) 



r x T / f(x 1 a(ey)n(z))g 1 (y/8)g 1 (Tz)dydz - f( Xl ) 



«max(T- 1 ,,5)5 00 , 1 (/). 

Indeed, (|9.12(l merely quantifies the fact that the right-hand side integral is against 
a probability measure supported in a very small ball (of size min((5, T -1 )) around 
x\. Since r\SL 2 (K) is assumed compact, the implicit constant of (|9.12|) may be 
taken independent of X\. 

Consequently, 
(9.13) 

i / ^(t/T)/(xoa(T)- 1 n(i))dt-Mj(a(T)- 1 /) « ma^T" 1 , <y)£oo,i(/)- 
1 JteR 

On the other hand, the measure n$ has a continuous distribution function hg, i.e. 
Vs(f) = Jt\sl 2 (r) fid) ■ hs(g)dg, and ^ d -(a(T)" 1 /) may be estimated using JO), 
i.e. the decay of matrix coefficients. 

A routine computation shows that ||ft,,5|| £ 2 <C <5 -1 / 2 and S2,i(hs) <C S~ 3 ^ 2 ; on 
account of the cocompactness of r\SL2(K), both these are estimates are uniform 
in xq. 

Using l|9.6[l now yields: 
(9.14) Ma(T)- 1 /)- / f(g)dg\< e T<*- 1 / 2 S 2t i(f)6- 1 -<. 

Jr\SL 2 (R) 

Finally, note that if X[o,t] denotes the characteristic function of [0, T] in the real 
line, then ^ J* R \g§(t/T) — X[o,T](t)\dt <C 5. It follows that 

f{x Q a(T)- l n{t))dt - I ' g 5 {t/T)f{x Q a{T)- l n{t))dt 



(9-15) \ 



<S-S OO)0 (f). 



Combining (|9.13|) . I|9.14|l and (|9.15|l . and replacing xq by xoa(T), we conclude 
that the left hand side of (|9.11|) is bounded by 

a (socit/)^- 1 ^) + r- 1 / 2 *- 1 - 6 + ^)) . 

We choose S 2 = T a_1 / 2 to obtain the claimed conclusion. □ 

9.3.2. Equidistribution of large horospheres on higher rank groups. We now prove 
quantitative equidistribution of large closed horospheres. This result is well-known 
and generalizes the result of Sarnak, that the closed horocycle {x + iy}o< x <i is 
equidistributed in SL2(Z)\H, as y — > 0. 

We shall follow the notation of Sec. 13.21 which we briefly reprise. Let G be 
a connected semisimple (real) Lie group, r c G a lattice, K c G the maximal 
compact subgroup, q the Lie algebra of G, and H S g a semisimple element. Fix 
arbitrarily a norm || ■ || on q. We equip G with the Haar measure in which T\G 
has volume 1. Let exp : g — > G be the exponential map. Let u be the sum of all 
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negative root spaces for H , and let U = exp(u) C G. Let xq E T\G be so that xqU 
is compact; note that the existence of such xo implies that T\G is noncompact. 

Let Xt — xq exp(iiJ), and let A t be the stabilizer of Xt in U. We denote by 
("i ')i 2 (r\G) the inner product in the Hilbert space L 2 (T\G). 

Lemma 9.5. There is n\ > such that, for any f,g S C°°(T\G) and for any 
U G u with unit length (w.r.t. the fixed norm \\ ■ \\ on q) we have: 
(9.16) 



(exp(sU) ■ f,g) 



1 


f 


1 o 


)T\G 




IV\G 


/ 


f 


/ ' 


IT\G 




IV\G 



< exp(-Ki \t | )S ooAim{K) (/)S'oo,dim(iir) (5) 



< (1 + \s\) Kl S ooAim{K )(f)S 00 ^ im (^ K )(g) 



Of course the constant k\ will depend on the choice of the norm || • ||. 

Proof. This follows from a nice result of Kleinbock and Margulis: see [T^l- (The 
orthogonal complement L\ of the identity representation in L 2 (r\G) is isolated, 
by ^3 Thm 1.12], from the trivial representation in the unitary dual of G. A 
sufficiently high tensor power of L\ is therefore tempered, whereupon one applies 
the bounds of Note that ^SI only claims the result (in effect) with for 
some d; the fact that we can take d = dim(_?£T) follows by explicating the argument 
just sketched. □ 

/a \r/ f{xTU)du 

Recall the definition of vt from Sec. 13.21 that is to say: VT[f) — ■ — voi(A r \t/) — ■ 
Thus vt is the measure supported on a closed horosphere, and this horosphere 
expands as T — > 00. One deduces from Lem. 19. 51 that the measures vt are equidis- 
tributed as T — > 00: 

Lemma 9.6. Set k% = ^TG\^m(Kj+l > Kl being as in the previous Lemma. Then, 
forT>0 and f € C°°{T\G), 

\Mf) 'I f\ « e-^SocAf)- 
Jr\G 

Proof. The idea is identical to Lem. 19.41 and we refer to the first paragraph of that 
proof for a description of it. 

Fix a left-invariant Riemannian metric on G. This descends to a metric on 
r\G. We first choose some "smoothing kernels" on G. For each e > 0, choose a 
function fc e e C°°{G) such that k t is positive, supported in an e-neighbourhood of 
the identity, J G k e = 1, and so that for any X\, X2, ■ ■ . , Xi G Q we have: 

(9.17) Bup|Xi...*,fc e | ^.....x, e^™^). 

gee 

It is easy to see this is possible (for example: choose an appropriate sequence of 
functions on g and transport to G via the exponential map.) 

The measure vq is a [/-invariant probability measure supported on the closed 
orbit xqU. vq * k e is supported in an e-neighbourhood of xqU and is given by 
integration against a C°° density function g e , that is: vq * k e (f) = J f \q fde- 

Moreover, it follows from (|^T7|) that g e satisfies the bounds S^ifa) < e -'-d™(G) ) 
for any / > 0. 
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The translate of vq * k e by exp(— TH) is supported in an e-neighbourhood of 
xtU; note it is essential that T > for this. (Recall - Sec. 12.11 - our conventions 
are such that the right translate of the point mass at x by g € G is the point mass 
at xg^ 1 .) 

In fact, one verifies that 

(9.18) M/) - v *k e (exp(TH) ■ f)\ « eS^^f). 

(Indeed, let g G supp(fc e ) and let S g be the point mass at g. It suffices to check 
that the identical bound holds for \vt{I) — * 5 g (exp(TH) ■ f)\, which equals 
Wr(f) — ^o(gexp(T_ff) ■ /)|. Let b the sum of non-negative root spaces for H on g. 
If e is sufficiently small, we may write g — um, with u € exp(u) and m € exp(b). 
Moreover, again if e is sufficiently small, u, m lie in a Ce-neighbourhood of the 
identity, for some fixed constant C. Then exp(—TH)gexp(TH) — um, with 
u' € exp(u) and where m € exp(b) is in a C e-neighbourhood of the identity, for 
some absolute C. Also, Uo(g exp(TiJ) • /) = vr(m'f). Thus it suffices to bound 
\v T {f) - vr(m'f)\. But the L°° norm of / - rri ■ f is < eS^iif).) 

On the other hand, by Lem. 19.51 for T > 0: we have 

Jr\G 

« exp(- Kl T)5 co , dim(K) (/)e- ^G)-6Sm{K) 

It follows from this and H9.18(l that 

\Mf) - [ f\ < (e + exp(- Kl T)e- dim ( G )- dim W)S 00 , dimW (/). 
Jt\g 

To conclude, take e = exp(- dim(G) ^f m(y)+1 ). □ 

9.4. The equidistribution of Hecke orbits and p-adic horocycles. In this 
section, we prove some "p-adic" equidistribution statements, pertaining to the 
equidistribution of Hecke points and p-adic horocycles. 

In the Lemmas that follow, f will be a prime ideal of F, ~pp the normalized Hecke 
measure defined subsequent to (|2.6(l . and [f] as defined in Sec. 12.51 

The first Lemma is an adelic version of the fact that the Hecke orbit T q (z) of a 
point z € SL(2, Z)\H is equidistributed, as z — * oo. 

Lemma 9.7. Let f 6 C ( 2 (Xql( 2 )) and f an ideal of F. Then, for xo € Xq L ( 2 ), 
d> 1, 
(9.20) 



(9.19) 


^o*fce(exp(Ti?)./)- f f 






Jr\G 





/*/x f (x Q )- V xdflMzo) / f(x)x{x)dfi x _(x) 

iifX 



« N(f)"- 1 /2+^ ht ( 2 ;) 1 /25 2id(/) . 



77ere x(cc) denotes the function g i— ► x(det(g)) on X. 

Proof. Let ^ be the projection defined in Sec. 12.71 Let £/ be the endomorphism 
f t-> (f - &>f) *7t f of C2°(X GL(2) ). The operator £ has norm < e N(f) Q - 1 / 2+£ 
w.r.t. the L 2 norm (this follows from Lem. 12.11 and the bounds of Sec. I9.1JI . By 
Lem. 18 .31 it follows that the operator norm of E w.r.t S2,d,0 is also <C e N(f) a " 1 / 2+e . 

The left hand side of 19.20(1 is exactly Ef(xo). Now apply Lem. 19.11 with p = 2, 
to conclude. □ 
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The next Lemma is an adelic version of the following (again closely connected to 
equidistribution of Hecke points). Let Y(p) be embedded in Y(l) x Y(l) (notation 
of discussion after Prop. I4.1fl . Then Y{jp) is equidistributed as p — > oo. The 
quantification of this is slightly complicated by noncompactness; in particular, we 
must use Sobolev norms S Pt d for p > 2. (Cf. discussion in Sec. 12.9.1(1 . 

Lemma 9.8. Let q be a prime ideal of Of- Let F € C°°(X x X) be PGL 2 (of„) x 
PGL 2 (of (1 ) invariant. Then, for any d^$> l,p > 2, 



(9.21) 



F(x,xa([q]))dx- ^ x(M) / F{x,y) X {x)x{y)d^(x)d^{y) 
x 2 =i x 



Proof. Let cr be the measure 8\ x ~p on PGL 2 (F q ) x PGL 2 (F q ), where 8\ is the 
measure consisting of a point mass at the identity. Recalling (see (I2.6|) in the case of 
a prime ideal, and Section ^. 5l for the definition of K q ) that 7I q is the i4T q -bi-invariant 
probability measure supported on K q a([q])I\ q , we note that 



(F * <j){x, x) = \ F(x, xk\a{[q])k2)dk\dk2 = I F(xk, xka([q]))dk, 

Jk u k 2 eK q Ji< q 

where we equip K q with the Haar measure of mass 1, and we use the PGL 2 (of„)- 
invariance of F at the second step. It follows that 



(9.22) / F(x,xa([q]))dx= / (F * a)(x, x)dx. 

Let ^2 be as in Sec. 12.71 Let E be the endomorphism of C°°(X x X) defined 
by E(F) = (F — 3P2F) * a. Combining 19.22f> and the easily verified equality 

(&> 2 F*cr)(x,x)dx= V x(W) / F(x,y)x(x)x(yWx(x)d(ix(y), 
2 1 jx 

we see that the left hand side of l|9.21|l is precisely J x EF(x, x)dx. 

Since &2 does not increase L°° norms, and a is a probability measure, it follows 
that the operator norm of E w.r.t the i°° norm is < 2. Moreover, the operator 
norm of E w.r.t the L 2 norm is <C N(q)" -1 / 2 , as follows from Lem. 12.11 

Lem. 18.31 now implies that for 2 < p < 00 we have the majorization S p> d(EF) <C 

N ( c l) 2£ ^ i+e5 kd(^)- Now Lem. |9~T1 shows that 

I / EF(x,x)dx\ « S p . d (EF) « N(q)^ i+e 5 p , d (F) 

JX 

for p > 2, (i ^> 1; whence the conclusion of the Lemma. □ 
The next Lemma shows the equidistribution of certain p-adic horocycle orbits, 
as p varies. The idea will be as follows: (speaking very loosely, in the case of 
SL 2 ) a typical p-adic horocycle orbit, when projected to SL(2, Z)\H, looks like {z + 
^}o<!<p-i- This set looks very much like the image, under the p-Hecke operator, of 
the point pz. Thus one can deduce distribution properties of the p-adic horocycle 
orbit from some standard facts about Hecke operators. 

This is a rather ad hoc argument. Let us say a few words about why this problem 
does not quite fit into the usual setup of such questions. We are proving statements 
about the distribution of e.g. p-adic horocycles when p varies. This does not fit 
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easily into the usual context of such matters, where one considers e.g. a fixed 
unipotcnt flow on an ^-arithmetic homogeneous space. It would be interesting to 
have a more conceptual and natural way of treating such questions, in the aspect 
where "p varies." 

Lemma 9.9. Let f € C°°(X) and let f be an integral ideal of Of, factorizing as 
f = Yla c l eq ■ F° r sach q|f, let s q > be a non-negative integer, and suppose f is 
invariant by Y[q\f Ko[Q l Sq ]- Put tn = IIq|f c \ s ' < ■ Let rjf be the Haar probability measure 
on riqif ■^(l _eqo q) an d dh the Haar probability measure on SL^-F^SL^Ap-). 
Then, for ye F X \A F , 



(9.23) f* m (a(y))- I f(ha(y))dh 

JheSL 2 (F)\SL 2 (A 
« e m°- 1/2+e max(N(f)| 2/ |, _^)l/2 N(m) 3/2 + e 52 d(/) 

Proof. As usual let K v j be the stabilizer of / in K v = GL^Ot,), so that K q j- 
contains i^o[q Sc| ] for each q|f. 

We now define a measure 7y q on PGL^Fq) for each q|f. It will "approximate" rjf 
but will be composed of Hecke operators. 

For those q such that s q = 0, put 



(9.24) T^Ntq)-^/^-^*^ ~N(q)^ i ( 5 a(ro -, q - 1) * / i qC ,- 1 . 

(We refer to Sec. 12.81 for definitions of pL-r appearing above.) For q such that 
s q > 1: we set (Tq to be the unique bi-jK'o [<T Sq ]-invariant probability measure on 
K [q Sq ]a(w e " )Ko{q Sq ], normalized to have mass 1, and we put ?7 q = 8 a ^-e q ) *cr q . 
Finally, set fj f = Ilq|f *7q- 

One then verifies by a direct computation that 

(9.25) f*Vf=f*Vf 

The intuition for this statement, in the classical setting, as as follows: let z S 
SL2(Z)\H. Then (for a prime number p) the set {z + i/p}o<K P -i is the p-Hecke 
orbit of pz, with the point p 2 z removed. In the case e q = 1, the first term on the 
right hand side of l|9.24|l corresponds to the p- Hecke orbit of pz, and the second 
term corresponds to removing the point p 2 z. 

More formally, to verify Ij9.25|l . the unramified computation, at those places 
where s q = 0, is easy; the ramified computation is just Hecke theory at ramified 
primes, see e.g. [331 Prop 3.33]. 20 

The projection 3? of Sec. 12.71 commutes with the action of GL2(Af), and so 
(l9~!>5)) holds also with / replaced by & f or / - &f. Moreover, ^f-kijf = 9>f. It 



^For the unramified assertion, let B = PGL^-FqV-fCq and let xo g B be the identity coset. 
The set B has the structure of the vertices of a q v + 1-valent tree. Let Si be the set of all vertices 
at distance eq — 2i (some i > 0) from a(vj~ e i )xq. Let 52 be the set of all vertices at even distance 
< e q — 1 — 2i (some i > 0) from a(ro — e i ~ 1 )xq. Then S2 C Si and Si — S2 is precisely the 
n(q — e i )-orbit of xo. As for the ramified case: one notes that, if s q > 0, the Haar measure on 
-R"o[l s ''] is just the pushforward of the Haar measure on n(o q ) X 0(0^) X n(q 3 i ) by the product 
map (n, a, n) 1— > nan. 
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follows that 

(9.26) f*Vfi*)= I f(hx)dh+(f - 3>J)*f,^x). 

JheSL 2 (F)\SL 2 (A F ) 

Set f = f — & f ■ Then, expanding the term f *fjf: 

(9.27) /*%= Yl f* II fy--*)*^* 

Sc{q|f,s,=0} q|f:a,>l 

JJ (N(q)- e ^ 2 5 Q(ro -e, , * Mq e q ) * [J (-Nfq)"^^-.,-!) * /i q 

q|f:s„=0,q£S qeS 

We now specialize to the case under consideration where x — a(y) for some 
y e Ap. For 5 c {q|f, s q = 0} set 

a s = II ^ II N(q)- e «/V« * II -N(q)-^/v,-i- 

q|f:s q >1 q|f:«„=0,qgS S(| =0,qeS 

With this notation, we have: 

(9.28) /*%(o(»))= /*^Uj/[fD n 

SC{q|f,Sq=0} \ s q =0 : qGS 

Now apply Lem. 19. II to see that, for any z S A J and d 3> 1, we have 

(9.29) f** s (a{z)) « max(|z|, |z|- 1 ) 1/2 PS 2 , d {f * *s), 

where we have used the easily verified fact that ht(a(z)) x max(|z|, | | 1 ) . 
Now, for any / £ C°°(X), we have 

[K max : K f „ g ] <]J[K q : K Q [q s "]][K m ^ : K f ] « e N(m) 1+£ [if max : K f \. 

By the bounds on matrix coefficients (|9.2|) . and recalling that m = T^[ q |^ q s i , we 
compute that 

PS2, d (f*a S ) « e N(m) 3 / 2+£ (N(f) J[ N(q)- 1 )"- 1 / 2+£ [] N(q)- 1 P5 2 , d (/). 

qGS qeS 

Combining this with (|9.28|) and (|9.29|l . we find that for each S C {q : s q > 1} 
(9.30) 

|/*%(a( tf ))| « £ N(f) - 1 /a+ £ N(m) 3 / 2+6 max(N(f)|v| J N(f)- 1 |i/|- 1 ) 1 /3p5 2 ,«,Cf) 
This bound is valid for all / £ C°°(X), not merely those / that are invariant by 
Ilq|f -^o[q s "]- Apply Rem. 18.11 © to the endomorphism / i— ► /* rjjf ; this shows 
that l|9.30|l remains valid, for any / £ C°°(X), if we replace PS24 by S^.d on 
the right hand side. Now, specialize to the case where / € C°°(X) is actually 
Ilq|f ^o[q s, ]-invariant and apply (|9.2(il) to obtain the conclusion of the Lemma. □ 

The Lemma that follows states an adelic version of the following fact: the mea- 
sure on SL2(Z)\H defined by v y :— q" 1 J2o<x<q-i approximates the uniform 

measure if y X q^ 1 ; more precisely we have an inequality that 
lW) -LW2„-« 

5 > 



v v(f) Jswz)\h/ 



is bounded by max(qy, -^jY^q' 6 S(f), where S is an appropriate Sobolev norm and 
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Lemma 9.10. Let f 6 C°°(X) and let notations be as in Sec. \@(see esp. ()6.4J) ). 
In particular, f is an integral ideal of Of, Q = N(f), [f] is as in and 



Mf) = / f(a(y)n([f]))d x y. 
Suppose f is invariant by Ko[q St <] 7 for each q|f, and put m = Iln|f l" 11 • ^ 



(9.31) 



v z(f) - / f(x)d(j,x.(x) 



x 



« e N(f)«- 1 /2+, N(m) 3/2 +£max(N(f)Zj 1 y/2 S (/) . 

N(f)z 



Proof. For each q|f and integer < e £ Z, let 77 q e be the Haar probability measure 
on the group iV(q _e o q ). Then, since the assumption implies that / is right invariant 
by a q (o^), for each q dividing f, we see that for any x £ X: 



(9.32) / f(xa(y)n,(zu- e ))d x y= \ ^ f{xn^ym- e ))d*y 

/*(v-N(q)- 1 V-0(*) 



yeo Fci Jyeo f 



1-N(q)"i 



It follows that 



".(/) = IR 1 -Nfa)" 1 )" 1 ' / /*II(V q -N(q)- 1 V.- 1 )(o(l/))- 
We conclude by applying the previous Lemma. □ 



10. Background on Eisenstein series. 

This section essentially develops the theory of Eisenstein series on PGL2 over a 
number field. This is needed for the Rankin-Selberg method that we reprise in the 
next section, which in turn is used in the text to relate a period integral with an 
L-function. 

Let Z be a topological space. In this section, we will often speak - in various 
contexts, often with Z = X or GL2(Ai?) - of a function F(s, z) on C x Z being 
"holomorphic" or "holomorphic in s." For the purposes of this document, this can 
be assumed to mean that the function is jointly continuous and holomorphic for 
each z individually. 

Note that s 1— * J z F(s, z)dz, if absolutely convergent and uniformly so in s, 
defines a holomorphic function. Indeed, it suffices to verify that its integral over a 
closed curve in the s-variable is zero, which follows by Fubini's theorem. 

Similarly, we will say that F(s, z) is meromorphic if there exists a holomorphic 
function h(s) so that h(s)F(s, z) is holomorphic. 



10.1. Construction and basic properties of the Eisenstein series. We recall 
the Eisenstein series that we shall have need of and its basic properties, following 
Jacquet [U\ §19]. We will need Eisenstein series only on PGL 2 . 
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10.1.1. Schwarz functions. Let ^/ be a Schwarz-Bruhat function on Ap, i.e. ^/ is a 
finite linear combination of functions Y[ v ty v , where each ty v is locally constant of 
compact support, for v finite, is a Schwarz function on F$ for v infinite, and 
is the characteristic function of o 2 , for almost all v. 

If v is a real place, choose a v £ F v so that eF v (x) — e 2mavX , and say a 
Schwarz function H> v on F 2 is standard if it is the product of a polynomial and 
e — 7r|a„|„(|a;|„+|2/|„)_ If ii is a complex place, choose a v £ F v so that &f v {x) = 
e 27r«Tr c/R (a„x). we ga y a Schwarz function ^f v is standard if it is the prod- 
uct of a polynomial and e _27r ' a '" / (I a: l*'+l3'l'>), The significance of this normalization 
is twofold: a standard function is automatically i^-finite and also the class of 
standard functions is self-dual under the Fourier transform corresponding to the 
character bf v . 

If V is a real vector space, then by a Schwarz norm on the space of Schwarz 
functions on V, we shall mean a norm S of the form 

(10.1) 5(4-) = supsup |(1 + ||x||) M I>*|, 

V x 

for some finite collection of constant-coefficients differential operators T> on V and 
some norm ||x|| on V. 
Put, for g £ GL 2 (A F ), 

/*( S ,<?) = |det(. 9 )| s f V((0 7 t)g)\t\ 2s d x t. 

The integral converges absolutely for 5R(s) > 1/2 and extends to a meromorphic 
function of s with possible poles at most at s = 0, 1/2. Moreover, for all s, 

(10.2) /*(( I I "jg) = \a/b\ s f(g). 

Put Ey(a,g) = J2 7 eB{F)\GL 2 (F) f( s >19)- Tms converges when Re(s) > 1, extends 
to a meromorphic function of s with a simple pole at s = 0,1 and satisfies the 
functional equation 

E-z{s,g) = J%(1 - s,g), 
where 'J is the Fourier transform 

(10.3) $(xi,2/i)=/ W(x,y)e F (xiy - yix)dxdy. 

Moreover, the pole at s — 1 is the constant function with value c\ J A2 ^(x,y)dxdy, 
and the pole at s = is the constant function with value C2^(0), where c\,C2 are 
constants (depending only on the choice of measure). Finally, for any fixed g the 
function s^s(l- s)Ey(s,g) decays rapidly in vertical strips, i.e. (1 + ^^^(l — 
s)Eq,(s, g)\ is bounded in any strip A < 5R(s) < B. The proof of all these properties 
follows from "Poisson summation" for F 2 C Ap, and we omit them. 

Moreover, the association \& i— ► Ey is twisted-equivariant for the natural GL2(A^)- 
action on the space of Schwarz functions and on C°°(X): that is to say, 

(10.4) E h . 9 {a,g) = | det(h)\- s (h ■ E*(s,g)), 

where h- denotes right translation by h. 

We give an example with F = Q (cf. (3.29)]). 
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Example 10.1. Suppose F — Q, $ = ty v where, for each finite v, ^ v is the 
characteristic function of the maximal compact of F v , and ^oo(x,y) — e~^( x +y ). 
Then E^(g) is determined by its restriction to SL2(R). 

Moreover, Eq,(s,g) descends from a function of g S SL2(R) to a function E* (s, z) 
onH= SL2(M)/S02, where the identification is g i— > g ■ i. In fact, 

(10.5) E*(s,z)=n-°r(s)a2s) £ ■ 

[c:d]GPl(Q) 1 1 

If we put £(s) = 7r _s / 2 r(s/2)C(s) ; then E*(s,z) has the Fourier expansion 
(10.6) 

E*(s,z) = £(2«)y'+£(2-2«y~*+Vi' ]T K s _ 1/2 (2nny) cos(2nny) £ (^y^ 

?iGM ab—n 

It satisfies the functional equation E*(s,z) = E*(l — s,z). Moreover it is a mero- 
morphic function of s with poles precisely at s = and s — 1 . In both cases the 
residue is the constant function. 

Motivated by this example, the reader may find it helpful to keep in mind the 
"dictionary": fy(s,g) corresponds to ir~ s T(s)((2s)y s = £_(2s)y s , and E^(s,g) to 
E*(s,z) as defined in l|l(J.5ll . 

Remark 10.1. Suppose ^ is invariant by x K max . Then fq, is a multiple of 
g i— ► ht(g) s , as follows from the uniqueness of spherical functions satisfying 1)10. 2[) . 
Thus, for 5R(s) > 1, E^{s,g) = c{s) E 7 gb(f)\gl 2 (f ) ht (7S) s - 

We now proceed to establish the "standard" properties of the Eisenstein series 
for E-q,. It is convenient to first recall an explicit bound for archimedcan Mellin 
transforms; the first part is Tate's thesis, and the second will only be needed much 
later. 

Lemma 10.1. Let v be archimedean and let be a Schwarz function on F v . The 
integral G(s) := J iefX ^ v (x)\x\ s d x x extends to a meromorphic function and: 

(1) ^~7j * s holomorphic, where Cf,v(s) is the local factor of the Dedekind 
(^-function of F at v. 

(2) For any N > 0, the function Gat(s) '■= II^lo( s *)^( s ) * s holomorphic 
in -Ji(s) > —N, and the absolute value of (1 + \s\) m Gn(s) in any strip 
—N < 3?(s) < A is bounded by some Schwarz norm (depending on A, N, M ; 
see (|10.1|) for the definition) of^ v . 

Proof. The first assertion is Tate's thesis, and we leave the second to the reader (if 
any). □ 

Lemma 10.2. The function s i— > s(l/2 — s)f-$(s,g) extends to a holomorphic 
function of s. It decays rapidly along vertical lines: 

(10.7) |(1 + |3( S )|) w S (l/2 - s)U( S) g)\ «* ht( 5 f « 

where the implicit constant is uniform for 3?(s) in a compact set. 

Proof. By (|10.2fl and the Iwasawa decomposition, it will suffice to prove the asser- 
tions in the special case g € x K max . So we write g = k 6 Koo x K max and 
denote by k v the component of k in PGL2(F V ). Moreover, without loss of generality, 
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we may assume ^ is a product of Schwarz functions at each place, i.e. ^ = J[ v ty v . 
Then 

Ms,g)= II J x *v((o,t)k)\t\ 2s d*t n J x * v ((o,t)k v )\t\ 2s d x t 

v infinite ® v finite v 

By Tate's thesis, it follows that that the product over finite places is of the form 
0(2s)/i(s), where Cf(') is the (finite part of the) Dedekind ^-function of the number 
field F and h(s) is a holomorphic function with at most polynomial growth in 
vertical strips (indeed, a polynomial in q ±s for various q). All the assertions of 
the Lemma now follow from Lem. 110.11 and standard facts about the analytic 
properties of 

In fact, if the *Sf v for v finite are regarded as fixed, then the implicit constant 
in (|10.7|) is bounded by an appropriate Schwarz norm, depending on N and the 
compact set to which 3?(s) is constrained, of J[ v infiuit0 $v This follows from the 
second assertion of Lem. 110.11 □ 

Lemma 10.3. The constant term E^j (s, g) := J x& p\^ F Ey(s,n(x)g)dx equals f-&(s,g)+ 
- s,g). 

Proof. (Sketch). A double coset decomposition shows that, for s ^> 1, E^(s,g) = 
/*( s , g)+I„ & N(A P ) /*( s ' wn 9)dn. So it will suffice to show that J neN{AF) h(s, wng) = 
/-j(l — s,g). The left-hand side may be expressed as 

(10.8) |det( 5 )| s f [ ■$((t,tx)g)\t\ 2s d x tdx 



det (s)i s / / E nm^g)\t\ 23 d x tdx 

Jt£A F /F* Jx£A F Sepx 



For any Schwarz function ^ on A F , one has Yla&F I y £A F ^( a > V) ~ ^2/3eF ^(0> P)- 
The result follows from routine manipulation and use of Tate's functional equa- 
tion. □ 

We set 

(10.9) E*(s,g) = E*(s,g) - fa(s,g) - - s,g), 

so Ey defines a function on B(F)\PGL,2(Ap). It is a "truncated" Eisenstein series 
where we have removed the constant term. Moreover, E^(^s,g) is holomorphic in s 
(this follows, for example, by computing residues at each of the points s = 0, 1/2, 1 
and seeing they are all zero). By definition, for g £ GL2(A^?) we have an equality 

(10.10) E*(s,g) = E*(s,g) + fa(s,g) + f$(l-s,g). 

Lemma 10.4. Let T, N > and let 3?(s) lie in a fixed compact subset o/R. Then 

(10.11) (1 + \s\) 4 E*(s,g) «Wt htfo)-* 

for g £ &(T). In particular, ifQ c X is compact, then s(l — s)E^(s, g) is uniformly 
bounded in |3?(s)| < 2, g £ f2. 

Proof. We first claim that, for t £ R, we have |(1 + t A )Ey(N + 1 + it,g)\ -C* 
ht(<?) +£ . Indeed, by definition, 

E-*(s,g)= h{s,7g)~ /$(l-s,s). 

7GS(F)\PGL 2 (F),7^S(F) 



SPARSE EQUIDISTRIBUTION PROBLEMS, PERIOD BOUNDS, AND SUBCONVEXITY 71 



In view of Lem. 110.21 it will suffice to show that 

(10.12) ]T ht( 7 .g) CT «e htfe) 1 -^, 

7 e-B(F)\PGL 2 (F), 7 ^S(_F) 

which follows from (|8.10|) and JHTHJ. 

Now (|10.11|l follows at once from the functional equation E^r(s, g) = E^(l — s, g), 
the maximal modulus principle in the strip |5ft(s)| < N + 1, and the previous 
Lemma. 21 

The second assertion (involving f2) follows from i|10.7|l and (|10.11|) . □ 

We now compute the Fourier coefficients of the Eiscnstein series in general. Re- 
call that ef is a fixed additive character of A F /F. 

Lemma 10.5. Set W^(s,g) = f xeF \ AF E^(s,n(x)g)e F (x)dx. Then, for^R(s) > 1, 

(10.13) W^(a(y)) = \y\ 1 - S [ ^(t,tx)e F {xy)\t\ 2s dxd x t. 



JteA F ,xGA F 

In particular, if "J — (^ v ^ v , then = Y[ v W-q, v , where for 5R(s) > 1, 
W*Xl/)) = \V\1~ S I ^ v (t,tx)e F (xy)\t\ 2 v s dxd x t, 
for y £ F v . Finally, if^ v {x,y) — ipi{x)ifi2{y) , &v a character of F v , and ^R(s') 

m(s)\ » i, 



(10.14) / WvMy))\y\ s "v(v)d x y 
JyeF* 

= I My)\y\ s ' +s My)d x y f ft(v)\v\ 1+,/ -a>v(v)d*v, 

where if2 is the Fourier transform, defined by ^pi{y) = J F if2{y)^F v (yt)dt. 
Proof. By the Bruhat decomposition, 



(10.15) W 9 (8,g)= [ e F {x) V h(in(x)g) 

JF\Af _ t^t ,^ 

fqi(wn(x)g)e F (x)dx. 



' F \ A r 7 eS(F)\PGL 2 (F) 



Thus 

(10.16) W*(s,g) = \det(g)\ s f f ^((t,0)n(x)g)d x t\t\ 2s e F (x) 

Jx£A F JteA* 

= \dct{g)\ s f ^{{t,tx)g)\t\ 2s e F {x)dxd x t. 

Jt£A F ,x£A F 

The claimed conclusion follows upon substituting g — a{y), together with some 
routine computations. □ 



21 To apply the maximal modulus principle in this context, one needs some a priori decay of 
Exp, which follows easily from the corresponding properties of E-q, and fq,. 
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Remark 10.2. Remark that Wq>(s,g) belongs to the Whittaker model of a certain 
induced representation of PGL2(Af), namely that representation 7r(s) induced from 
the character a(y) <— > \y\ s _1 / 2 of the maximal torus (unitary induction, so 7r(s) is 
tempered for 3?(s) = 1/2). This representation is the tensor product of local rep- 
resentations -k v (s), analogously defined; these local representations are irreducible 
and generic for all s. 

Thus (|10.13fl determines Wq, uniquely (the theory of the Kirillov model) . Simi- 
larly the condition W*„, s (l) = 1 uniquely determines the (spherical) vector W^ vtS . 

We finally remark that Wy v , s , as ^f> v ranges over all Schwarz-Bruhat functions 
on F 2 if v is nonarchimedean, or over all standard functions if v is archimedean, 
exhausts the Whittaker model of ir(s). Indeed, the set of such functions Wq,^ iS is 
a subspace of the Whittaker model of ir(s) that is stable under the action of the 
Hecke algebra of PGL2(F„); this action is irreducible, whence the result. 

We recall that denotes the different (Sec. 12.3(1 and we denote by (f.v(s) or 
simply ( v (s) the local factor of the Dedekind £- function of F at the place v. 

Corollary 10.1. Suppose v is nonarchimedean, and^ v the characteristic function 
of o 2 . Then W v (a(y)) satisfies 

(10.17) f W v (a(y))\y\ s 'd*y = q^ 1+s '~ s \ v (s + s')( v (l-s + s'), 

with d v = v(d). Note that this specifies W v , because it is K v -invariant. 

In particular, for each finite v with v(d) = 0, the function W v (g) is the unique 
spherical Whittaker function on Gl^i 7 ^) with Hecke eigenvalue qy + ql~ s , and with 
W v (l) = 1. 

As is evident from l|10.6f) . the Eisenstein series themselves are not bounded. They 
belong to L 2 ~ e , but not L 2 . To avoid some difficulties with growth, we shall use 
wave-packets of Eisenstein series. We now turn to their analysis. 

10.2. Regularization of Eisenstein series on PGL2. Our aim in this section 
is to show that an appropriate "wave packet" of the Eisenstein series Eq,(g,s) 
constructed in the previous section lies in L°° . 

Note that, in Example 110.11 above E*(s, z) differs from the usual unitary Eisen- 
stein series by a factor £(2s). This factor ensures that E*(s,z) is holomorphic, 
but this causes an inconvenience at s = 1/2, which will manifest itself in our con- 
struction of bounded wave-packets. Recall that this pole can be interpreted rather 
naturally: see footnote on p. 1381 

Let k > 0, and let 7Y(k) be the family of functions holomorphic in an open 
neighbourhood of the strip —k < 5ft(s) < 1 + k, with rapid polynomial decay in 
vertical strips (i.e. sup tgR (l + |t|) Ar |/i((T+it)| is bounded, for each N, by a continuous 
function of a) and satisfying h(0) = h(^) = h(l) — 0. For each N £ Z we have a 
norm || • 1 1 jv on TL{n) defined via: 

/oo 
(\h{\ + k + it) I + \h(-K + it)\) (1 + \t\) N dt. 
-OQ 

Lemma 10.6. Let h G 'H(k), and set Eh^(g) = J^f s \— 1+K h(s)E^,(g, s)ds. Then: 

\\Eh,m(g)\\L°° <*, k ,_f \\h\\ 



SPARSE EQUIDISTRIBUTION PROBLEMS, PERIOD BOUNDS, AND SUBCONVEXITY 73 



Proof. In the notation of [jlO.lOjl 

(10.19) E h ,v(jg)= [ E*(s,g)h(s)ds+ [ h(s)fc(s,g)ds 

J5f(s) = l + K J3t(s) = l + K 

+ / h(a)f$(l-a,g)d8. 

Fix T > so that &(T) surjects onto X (see Section lH^l for definitions). We will 
bound each term on the right-hand side of the above equation for g 6 6 (T) . 

By Lem. 110.41 the first term on the right-hand side is 0* jK (||/i||o). By Lem. 
110.21 the function — s,g) is uniformly bounded above in the region 3?(s) = 
1 + K,g € S(T); thus the third term on the right-hand side is also 0^ K (||/i||o). 

As for the second term, we shift contours to the line 5ft(s) — —k. The shift of 
contours is justified by the rapid decay of h(s) along vertical lines and Lem. 110.21 
Moreover, since h(0) = h(l/2) = 0, the function s i— > h(s) fa (s , g) has no poles in 
between the contours. 

Applying Lem. 1 10. 21 one more time to control the contour integral along 3?(s) = 
— re, we conclude. □ 

Remark 10.3. Suppose W = and the \E'„ are regarded as fixed for v finite. 

Put = FL finite**" a Schwarz function on A 2 F f , and = EL., infinite **>• Tnen 
the above argument gives the slightly more explicit bound 

(10.20) l|£h,*lk- ||fc||oS(*oo) 

where S is a Schwarz norm on F^. This follows by explicating the above argument, 
taking into account the last sentence of the proof of Lem. 110.21 Indeed, one obtains 
even the corresponding bound for Sobolev norms, namely 

(10.21) SaoApiEhp) ||fe||o5(*oo) 

for an appropriate Schwarz norm of . One deduces this from (|10.20fl upon noting 
that, if D belongs to the universal enveloping algebra of SL2(i 7 ' 00 ), then, by (|1C).4|> . 
VEy(s,g) — Ex>y(s,g), so also VEh,^ — E^.v^- It is then easy to check that a 
Schwarz norm of X^oo is bounded by a Schwarz norm of 'too . 

10.3. Regularization of Eisenstein series on PGL2 x PGL2. In this section we 
carry out the analogue of Lem. 110. 61 in the context of PGL2 x PGL2 (this amounts 
to regularizing the rank 2 Eisenstein series on PGL2 x PGL2). 

To ease the reader's path, we briefly mention what the point of this section is 
in classical notation: Suppose h(si,S2) is holomorphic in two variables inside the 
square |3?(si)| + 1 9? (S2 ) | < 1/2 + re, and, moreover, h(si, s 2 ) has zeroes along the six 
planes defined by any of the linear constraints s% — 0, si = 1/2, si = — 1/2, S2 = 
0,s 2 = -l/2,s 2 = l/2. 

Define the wave-packet E} l {z\,Z2) on SL 2 (Z)\H x SL 2 (Z)\H via 

E h ( Zl ,z 2 )= I h(it 1 ,it 2 )E*(l/2 + it,z 1 )E*(l/2 + iif,z 2 )dtdl/. 
Jt.t'eK. 

Here E* is as in Example ll0.ll We shall show - under mild decay conditions on 
h -that Eh{z\,Z2) is majorized, on the product of two fundamental regions, by 
A{yi,V2) := y{/2 ^\,^ ■ Since Afa, 2/ 2 ) 4 ^f is finite, E h lies in L\ 

and even in L +e for e small. 



74 



AKSHAY VENKATESH 



As the reader may verify at this point, the majorization is little more than an 
exercise in complex integration, using the fact that the large contribution to the 
Eisenstein series comes from the constant term. 

We will need to repeatedly shift contours in the setting of a function of two 
complex variables. To clarify matters, we state the following Lemma, which we will 
use repeatedly without explicitly invoking it. 

Lemma 10.7. Suppose U C K 2 is an open domain and f(z\,Z2) a holomorphic 
function on the complex domain {(2:1,2:2) G C 2 : (5R(zi), 5ft (2:2)) G U}. Suppose 
moreover that there is, a continuous function M : U — > K such that 

(10.22) sup \f {en + ih, a 2 + it 2 )\{l + \h\ + \t 2 \) 3 <M(ai,a 2 ). 

(ti,t 2 )GR 2 

Then the function 

(10.23) (ci,cr 2 )^ / f(z 1 ,z 2 )dz 1 dz 2 

jR(z 1 )=a 1 M(z 2 )=a 2 

is locally constant on U . 
We omit the easy proof. 

We will now introduce a family of normed spaces TC^ 2 \k). In fact, the spaces 
themselves are independent of re, but the norm depends on re. These are spaces 
of holomorphic functions in two variables 21,22; and the norm, roughly speaking, 
controls the behavior of h when the real parts of {z\, z 2 ) lie in the square |5R(^i)| + 
|»(z 2 )| < 1/2 + K. 

Definition 10.1. Let < re < 1. Let H^ 2 '(k) be the family of functions h{z\,z 2 ) 
in two complex variables, holomorphic in a neighbourhood of (0,0), and satisfying: 

(1) Write h! = ZlZ2 ( 1 /4^^|^ 4 _ z 2- ) - Then h' , originally a meromorphic func- 
tion in a neigbourhood of 0, extends to a holomorphic function in the strip 
{2i:|5ft(2i)|<2}x{22:|5R(2 2 )|<2}. 

(2) Growth condition: for every N > 0, 

sup sup (1 + |*| + \t'\) N h(a + it, a + it) < 00 

{o,a')e[-2, 2 } 2 t.t'eR 2 

For each N G Z we introduce a norm on (re) via: 



(10.24) 




(|/i'(ei(l/2 + re) +it,it')\ + \h'(it, e 2 (l/2 + re) + it')\) dtdt' . 
Lemma 10.8. For h G # (2) («0; put 

E h M,.*>{gi,g 2 )= [ [ h(t,t')Ey{g u l/2 + t)E*,(g 2 ,l/2 + t'). 

JU(t)=0 JU(t')=0 

Then 

rp ( ^ ht^Q^ht^) 1 / 2 
E h , 9 ,*.{xx,X2) mo u{xi)1/2+K+u{x2)1/2+K . 

Proof. We may assume that \\h\\o = 1. Let notations be as established prior to 
Lem. 110.41 We will proceed as in Lem. 110.61 expanding via (|10.10|l . 
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It will suffice to give an upper bound, in absolute value, for each of: 

(10.25) / (<?i,<72)= / h{t,£)E^,{g x ,l/2 + t)E^ 2 {g 2 ,l/2 + £)dtd£ 
Jt,t' 



(10.26) h(gi,g 2 ) = h(t,t / )E^ 1 (g 1 ,l/2 + t)fa 2 (g 2 ,l/2±t / )dtdt' 

Jt.v 

(10.27) 12(91,92)= [ h(t,t')h 1 (g 1 ,l/2±t)E^ 2 (g2,l/2 + t')dtdt' 

J t.t 1 

(10.28) 13(31,32)=/ h(t,t')h 1 (g 1 ,l/2±t)h 2 (g 2 ,l/2±t')dtdt', 



Jt,t' 

whenever 1 J r Xi^ r 2 are Schwarz functions on A p , and in each case the contour of 
integration is the surface 9?(i) = JJ(i') = 0. Moreover, in view of condition (|TJ in 
Def. 110.11 each integrand extends to a holomorphic function of (t, t') in the region 
|»(t)| < 1/2 + K,\$l(t')\ < 1/2 + K. 

The bound |/o| "C ht(<3 r i)~'' v ht(32) _ '' V follows from Lem. 110.41 whereas the 
bounds < ht(.gi) _Ar ht(.g 2 ) _K and |J 2 | < ht(g 2 y N ht(gi)- K follow from moving 
the t' (in the case of I\) integral to the contour 5R(t') = ±(1/2 + k), applying Lem. 
Unhand Lem. ITO 

We now turn to I3. We shall consider the case where both signs are +, the other 
cases being similar with appropriate interchanges of sign. Thus set 

Z(t, t') = h(t, 0/*! (.91, 1/2 + t)/* a (.92, 1/2 + t 1 ) 

' ' = &'(«, t')*(V4 - i 2 )/*! (3i, 1/2 + i)*'(l/4 - t' 2 )h 2 (g 2 , 1/2 + i') 

In view of Lem. 110.21 the function Z(t,t') satisfies the conditions for / in Lem. 
HTT71 We apply Lem. Ilu"7l to shift the contour to 9?(i) = -1/2 - K,ft(t') = 0. 
Now Lem. 110.21 implies that | ^j( t j__ 1 /2- /s ,»(t')=o 

Z(t,t')\ « ht( ff2 ) 1/2 ht( 5l )- K . A 
similar bound holds with (31,32) interchanged, so in fact we have the stronger 
bound \Z(t,t')\ < min(ht(32) 1/2 ht(gi)- K ,ht(gi) 1 / 2 ht(g 2 ) _K )- This may also be 
written \Z(t,t')\ « ^^^ig^, . 

Similar considerations apply to the terms in 7 3 corresponding to other choices of 
sign, so we conclude that \I 3 \ < ht^y^+hiZl) 1 ^ ' D 

Lemma 10.9. Let notations be as in the previous Lemma. For any p < 1 / t r>K , any 
d,/3>0, there exists N such that 

Proof. Indeed, we note that 

/ Vvm \ P dy x dy 2 

i I 1/2+K . 1/2+K I 7/2, ,2 

'yi,V2>i \Vi +y 2 J 2/12/2 



I 

■J U 1 



2„,2 <°° 



whenever p < 1 _» . We apply the previous Lemma and reduction theory to con- 
clude. □ 

11. Background on integral representations of ^-functions. 

The purpose of this section is as follows. The geometric method we have ex- 
plained in the text yields upper bounds for certain periods; to obtain subconvexity, 
we need to know that L-functions can be expressed in terms of these periods. This 
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is the whole point of the theory of integral representations of L- functions; however, 
we cannot quite simply quote from that theory, as we often need e.g. some analytic 
control on the choice of test vector for which there is no readily available reference. 

On occasion we have only sketched proofs in this section, as they amount to 
simple explications of standard techniques such as the Rankin- Selberg method, 
and moreover they are in some sense irrelevant to the main point of this paper 
(which is to bound periods, not L- functions!) 

11.1. Cuspidal triple product L- functions. 

Hypothesis 11.1. Let 7T2 and 773 be fixed automorphic cuspidal representations 
of PGL2 over F. Let ~k\ be an automorphic cuspidal representation, whose finite 
conductor is a prime ideal, prime to the finite conductors 0/772 and 773. Suppose 
thatiTi^oo (the representation o/PGL^-Foo) underlying is restricted to a bounded 
set; let ip± be the new vector in -k\. 

Then there exists finite collections of vectors T% C 772 1 <7"3 C 773 so that, for any 
such 771, there exist ipj S Tj (j = 2,3) with 

\j x (f2(xa([p}))(p 3 (x)(pi(x)dx\ 

Note that no claim is made about the dependence of the constants in (|11.1|) on 
77 2 , 77 3 or the bounded set containing 77100; presumably with enough effort one could 
obtain polynomial dependence on the conductors. 

The proof of Hypothesis 111.11 should be, we believe, an elaborate but routine 
computation of certain p-adic integrals; this has not carried out, but we expect it 
to be valid. 

In the case when F = Q and 771 , 772 , 773 holomorphic Hypothesis 111.11 may follow 
(in a slightly modified form, replacing PGL2 by a division algebra) from the work 
of Bocherer and Schulze-Pillot. In any case, there exist good heuristic reasons to 
believe the Hypothesis: based on a computation of the size of the relevant family, 
or alternately it is true if one of the 77 j is Eisenstcin. 

11.2. Rankin-Selberg convolutions. 

11.2.1. The Rankin-Selberg integral representation. Let 771,772 be two automorphic 
representations, with 772 cuspidal. 

Let if? v be a Schwarz-Bruhat function on F£ such that, for almost all v, ^ v is 
the characteristic function of op . Put ^ — JT ^> v , a Schwarz function on Af.. Let 
Pj belong to the space of itj for j = 1,2 and put 



(11-2) I(tp 1 ,(p 2 ,*,s)= <Pi(g)<P2{g)Ev(s,g)dg 

ix 

Unwinding, we see that for 3?(s) > 1: 



(11.3) /(</?i, ^2,*,s) = c F / h{s,g)^{g)ip 2 {g) 

JB(F)\PGL 2 ( 



= c F I h(s,g)\ / ipi(ng)ip 2 (ng)dn dg 

J B(F)\PGL 2 (A F ) \Jn£N(F)\N(A F ) J 

Here the constant cf arises from change of measure: the measure on X is the 
PGL2(Ai?)-invariant probability measure, which is not the same as the quotient 
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measure from PGL2(Aj?). Note that cf will be unimportant in our arguments, as 
it depends only on F and we are only interested in bounds. 

Put Wi(g) = J F ^ Ap ipi(n(x)g)eF(x)dx, and define W 2 similarly but with ep 
replaced by e~F. Recall that our normalizations are so that the volume of Af/F is 
1. Fourier inversion shows that (fii(g) = J2aeF y - Wi{a{a)g) if <fi is cuspidal. Thus, 
as long as one of ipi, tp 2 is cuspidal, we see that: 

(11.4) I(<pi,<f»,%8)=c F [ U(s,g) I J2 W 1 (a(a)g)W 2 (a(a)g) ) 

Jb(f)\pgl 2 (a f ) \ a eFy ) 



= c F W 1 (g)W 2 (g)h(s,g)dg 

J N(A F )\PGL 2 (A F ) 

If ipi, ifi2 are pure tensors, then there is a corresponding product decomposition 
= rijj^ r i,'W)^ / 2 = Ili;^2,t), where Wj jV belongs to the local Whittaker model 
of 7Tj, v , a representation of PGL^-F,,). In that case, 

(11.5) I(<pi, <p 2 , s) = c F J] Iv(W hv ,W 2 , v ,y v ,s), 

V 

where 

(11.6) I v (W ltV ,W 2>v ,* v ,s) = 

Wi{g v )W 2 (g v ) (\det{g v )\ s v f *((0,t)g v )\t\ 2s d x t 

JV(F„)\PGL 2 (F„) V JteF* 

We note that the bracketed quantity, defined a priori for g v 6 GL 2 (F V ), descends 
to PGL 2 (F„). 

Lemma 11.1. Suppose Wi^ v ,W 2jV the new vectors associated to spherical repre- 
sentations TTi >v ,ir 2tV , ^ v is the characteristic function of 0%, and ep„ is unramified. 
Then I v {Wi, v , W 2 , v , 9 V ) = L v (s 

If Wi }V ,W 2}V are nonzero and PGL 2 (o v ) -invariant, ^ v as above, but e Fv is 
possibly ramified, then I v {Wi tV ,W 2tV , i S v ) = aq ks L v (s, iri. v x ir 2 . v ) where k £ Z 
is so that ep v is trivial on zu~ V but not on w~ l v . Moreover, a = 1 if 
W 1>v (zu~ k ) = W 2 , v {w~ k ) = 1. 

Suppose Wi }V ,W 2>v are the new vectors associated to ni jV spherical and n 2>v a 
Steinberg representation, and that e Fv is unramified. Then, with the character- 
istic function ofo 2 , we have: 

( 1 \ q s 
\ u w v J q v + 1 

Proof. See ^| Thm 15.9] for the first assertion. The second assertion is an easy 
consequence. See Sec. II 1.31 for the final assertion. □ 
Applying the Iwasawa decomposition to (|11.6|) yields the equivalent 

(11.7) Iv(Wi, v ,W2,v,*v,s)= I W 1 (a(y)k)W 2 (a(y)k)\y\^ 1 d x y 

JyeF*,keK v 

<f((0,t)k)\t\ 2s d x t) 

i6F* / 
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11.2.2. Topologizing the space of local representations. The results in |17j provide 
"good" test vectors for the functionals /„ when the local representations tti, v ,tt2,v 
are fixed. On the other hand, we will need such results with some mild uniformity 
in TTi.t,. One can certainly extract the stronger results from the proofs in |17j . For 
now, we will proceed by deducing the results "by continuity" ; to do this, we will 
need to define the topology on the space of possible tti,,,. The considerations that 
follow are not really very crucial; it would be better simply to explicate the implicit 
dependences in [T7|. 

Let & be a hnite set of irreducible (continuous) representations of K v = PGL2 ((>,„). 
For any representation W of K v , we denote by that subspace of W consisting 
of vectors whose K v -spa.n contains only irreducibles that belong to 5? . We shall 
say that elements of W 9 are of type 5? . 

Let % be the set of isomorphism classes of generic irreducible representations of 
PGL2(-F„). If 7T is a generic irreducible representation which is a discrete series or 
supercuspidal, we shall define it to be isolated. Otherwise, tt is induced from two 
quasicharacters fx, v : F* — > C. For s £ C, let (tt(s),V(s)) be the representation 
induced from the quasicharacters [x\ ■ |J, v\ ■ \~ s . Then 7f(s) is generic for all s £ C 
and irreducible in a neighbourhood of 0. We shall topologize Sf„ in such a way that 
sets of the form [tt(s)], for \s\ < £ form a basis. 

If E C @ v is a closed subset that is bounded (when considered as a subset of the 
set of isomorphism classes of irreducible admissible representations, and bounded 
in the sense of Sec. 12.12.3(1 . then E is compact, as one checks by direct verification. 

For each tt £ Sf, we have a Whittaker model W{tt). Consider a function tt i— > W„, 
that assigns to each tt £ <$ v an element W w of its Whittaker model. We shall say 
that such an assignment tt 1— > W v is continuous if there exists a neighbourhood of 
each tt, which we may assume to be of the form, {tt(s) : \s\ < e}, and a set 5? of 
irreducible representations of K v so that: 

(1) W^s) is of type S^, for each \s\ < e 

(2) The assignment s 1— > W / 7r ( s )(5 l ) is continuous for each g £ PGL 2 (F t; ), uni- 
formly for g in any fixed compact. 

It can be verified that if Wo is an element of the Whittaker model of ttq , there 
exists a continuous assignment tt i— > W„ in a neighbourhood of ttq which has the 
value Wo at ttq. 

The requirement J5J is not very strong, as it does not impose any uniformity on all 
of PGL2(i 7 ' 1 ,). However, in every context we shall consider, the necessary uniformity 
in g is automatic. Let us sketch how one can prove such results. Assume that v 
is finite; the infinite case is similar although more technically involved. One first 
observes that if 7r 1 — * W w is a continuous assignment on some open set, then, for a 

J pX W 7T (a(y))xv(y)\y\ s - 1/2 d ,l y 

fixed character Xv of F„ x , the quotient ve " l (s * ®x ) ^ s a P°ly nom i a l 

of the form J2^ = _ N Ckq^ s ; moreover, the degree N is locally bounded as tt varies, 
and all the coefficients Ck can be taken to depend continuously on 7r. To verify 
the local boundedness of the degree - which requires only property above - 
one just notes that there is (locally) a fixed M such that W 7T (a(y)) vanishes for 
\y\ v > M; this, together with the functional equation, gives the local boundedness. 
To see that the coefficients vary continuously, it suffices to check that, for any 
fixed integer t, the integral J e ^ =( W 7r (a(y))xv(y)\y\ s ~ 1 ^ 2 d x y varies continuously, 
which follows from the definition of continuity for the assignment tt 1— > W^. The 
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archimedean case proceeds similarly, but one replaces the role of polynomials in 
q^ s by functions of the form c s P(s), where P is a polynomial and cel. 

In the next few pages, we will make certain claims regarding the continuity 
of various integrals involving W w , if tt <— > W, is a continuous assignment. One 
can reduce all the claimed continuity statements (by standard "Mellin transform" 
arguments) to the result just discussed. We will omit the details. 

11.2.3. Choice of test vectors. 

Lemma 11.2. Let notation be as above. 
(1) The quotient 



3„(Wi,„,W2, v ,#„,s) 



L v (s, TTi tV ® n 2 .v) 



is holomorphic in s. If v is nonarchimedean, 3„ is a polynomial in q^ ; if 
v is archimedean and is standard, then S„|a^|^ s is a polynomial in s. 

(2) For any fixed sq G C we may choose data (Wi tV ,W2,v>^v) of the type 
described in with 3 v (W\ yV , W% tV , So) =f= 0. 

(3) If 7T2,u is regarded as fixed, and tt\ jV remains within a fixed compact subset 
of C S V consisting entirely of unitarizable representations, then there exists a 
constant C depending on the compact set so that one may choose data as 
in ^) in such a way that: 

(a) ty v and W2 v may both be chosen from a finite list of size < C; 

(b) J p « \W hv (a(y))\ 2 d-y<C; 

(c) |S„(s )| > 1 and, for all s G C. we have \E v (s)\ < C^ s ^(l + \s\) c . 

Proof. The first two assertions are in |17j . 

We will only sketch the last assertion. It can be also be proved directly by ex- 
hibiting such data by explicating the arguments of |17j . In any case, we start by 
noting: Given any continuous assignment tti,^ i— > Wi tV ,Tt2,v ► W2, v , the function 
&v(Wi lV , W2, v , ^v, s) and J \W^ v (a(y)\ 2 d x y all varies continously This 
assertion can be deduced by the methods explained in Section Til. 2. 21 Here, when 
we speak of S„(W r i^, W2, v , ^v, s) varying continuously, we mean this in the "strong 
sense" , i.e the statement that S„ can be expressed as a polynomial in q* (nonar- 
chimedean case) or b s P(s) where P is polynomial (archimedean case), so that all 
coefficients vary continuously with 7Ti,„, tt2, v - 

Now, given fix momentarily ni v and iT2,v &ud suppose we have chosen data 
(Wi tV ,W2,v,^v) as in (0). Extend W\ }V to a continuous assignment tt 1— > in 
a neighbourhood of tti,„. By the remarks above, (W n ,W2, v ,^v) will satisfy (|3b|l 
and l|3c|) . for a suitable constant C, whenever tt belongs to a sufficiently small 
neighbourhood of tt\ v . Now a compactness argument demonstrates pj. □ 

We emphasize again that (| 1 1 . 6|) is valid so long as one of tt\ , tt 2 is cuspidal. 

Lemma 11.3. Let tt be an automorphic cuspidal representation in L 2 (X), and let 
ip G tt be so that W v := f F \ Ap eF(x)ip(n(x)g) factorizes as a product Y[ v Wv(g)- 
Then, for a certain constant absolute constant c 



(11.8) f \tp(g)\ 2 dg = cRes s=1 A(s,7r®n)Y[- 



f F * \W v (a(y))\ 2 d»y 

L v (s, TT V <g> TT V ) 



so 



AKSHAY VENKATESH 



Proof. This follows by taking the residue of I((p,(p,^,s) at s = 1. Indeed, this 
residue equals, up to a constant depending only on the measure normalization, 

(Jx lv(5)| 2 %) (/a 2 ^( x ^y)dxdy^j (see discussion of properties of after H10.3J) . 

On the other hand, by (|11.5(l and (|11.6|) I(ip, (p, "J, s) may be written as a 
product cp Y\ v I V (W V , W v , s), where each I v is given by l|11.7|l . The integral 
f F x \W v (a(y)k)\ 2 d x y is independent of k £ K v , so I V (W V , W v , 1) factors as 
the product of f y< - F * \W v {a{y))\ 2 d x y and f teF * MKv *„((0, i)fc)|i| 2 dt. The latter 
integral differs from Jp 2 ^(x, y)dxdy by a factor that depends only on the normal- 
izations of measure; moreover, this factor equals (1 — q^ 2 ) 1 for almost all v, so 
the product of these factors is convergent. The conclusion easily follows. □ 

We now specialize to the cases of interest. Fix tt\. We vary tt^ :— tt through a 
sequence of automorphic cuspidal representations with prime conductor p, prime 
to the conductor of m. In particular, the local constituent of tt at p is a special 
representation. We denote by tt^, the representation of GL^-Foo) underlying the 
representation tt. 

Lemma 11.4. Suppose the archimedean constituent tt ^ belongs to a bounded subset 
o/ PGL^-Foo) (in what follows the implicit constants may depend on this subset) 
and regard tt\ as being fixed. 

Let so £ C. There exists a fixed finite set T of Schwarz Bruhat functions and a 
real number 22 C > so that, for any such tt, 

There exist vectors p\ £ tt\ , <p £ tt and £ T so that 

A(S,7T1 <g)7T) 

is holomorphic and satisfies: 

(1) |$(s )|»l and |$(s)| « £7^)1(1 + |s|) c ; 

(2) At any nonarchimedean place v such that tt\ and tt are both unramified, 
both p and p>\ are invariant by PGL^Op- ); 

(3) HvJilU- is 0(1). 

(4) |M|L 2 (X)«eN(p) e . 

Proof. We first choose local data. For each place where e F ,v and tt\ are not ramified, 
we take W v (resp. W„,i) to be the new vector in the Whittaker model of tt v (resp 
TTi.u)- We put \l/„ to be the characteristic function of o 2 . 

Let B be the set of remaining v. For v £ £>, the assumptions show that tt v 
is restricted to a bounded set. We choose W v , W v ,i,^f v for v £ B according to 
Lem. 111.21 Finally we choose <p so that J x€F \ Af eF(x)<p(n(x)g) — JT„ W v (g), and 
similarly for ipi, and take ^ = J| 'Fy. The first two assertions of the Lemma are 
immediate. 

To bound the L 2 norm of ip, use Lem. 111.21 (|3 b|) . Lem. 111.31 and Iwaniec's 
bounds on L- functions near 1. As for (pi, it in fact belongs to a fixed finite set of 
cusp forms, so the third assertion is immediate. □ 

We continue to keep tt an automorphic cuspidal representation of PGL2(A^) 
with prime conductor. 



22 depending on 7Ti and the choice of bounded subset of PGI^-Foo) 
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Lemma 11.5. Suppose belongs to a bounded subset of PGh2{F oc ) (in what 
follows the implicit constants may depend on this bounded subset). 

Let to,t/ € C. There exists a fixed finite set T of Schwarz Bruhat functions and 
a real number C > so that: 

There exist vectors tp € ir, \E , 2 € T so that: 

(11 9) $(f t') = ^( )i/2-t Sx.^9)E^ 1 (g, \ + t)E* 2 {ga({p]), \+t')dg 

is holomorphic and satisfies: 

(1) |$(t ,*d)l » 1 and |$(t,f)l « (1 + \t\ + | t '|)C C |K(t)|+|SR(t')l. 

(2) For any nonarchimedean place v, each and ^2 is invariant by PGL2(o p v ) ■ 

(3) \\<p\\l><x) «eN(p) e . 

Proof. The proof is similar to that of the previous Lemma; recall that l|11.6fl was 
valid as long as one of 1^1,^2 were cuspidal. 

Let be the translate of the Schwarz function Hf 2 by a([p]). Then by I1U.4|) . 

E n (s,g) = N(p)- s El { l p]) (s,g) 

Suppose ^1,^2 factorize as Y\ v J\ v ^2,u, and define Wv ltV (s, g) and Wi& 3 {s,g) 
as in Lem. I10.5I Suppose moreover that J xe jr\A F &F{x)^p(n(x)g) factorizes as 
Ilu W v (g) . Then we can express the global integral of (|11.9|l as a product in two dif- 
ferent ways, depending on whether we let or E^ 2 play the role of 772. Namely, 
as in lfTTT7|) : 

(11.10) J <p(g)E^ (g, \ + t)E* 3 (ga([p}), \ + t')dg 

= c F N(p) 1 / 2+t ' Y[ I v (W v ,W 9ltV (1/2 + t, •), * 2 , t „ 1/2 + t') 

V 

= c F [J I V (W V ,W^ V (1/2 + if, .)»(M>", * li0 , 1/2 + t) 

V 

Here W^ 2 (1/2+i', •) a " p " v denotes the translate of W^ 2 by the vth component 
ofo([p]). 

For v nonarchimedean (notations being similar to that of the previous Lemma) 
we take ^i lV and ^2 v to be the characteristic function of 0^ for every finite v, and 
W v to be the new vector. 

For v archimedean we first apply Lem. 111.21 with sq = 1/2 + to, and TT2,v the 
representation of PGL2(F„) spanned by E^ 2 (l/2 + f ,g), i.e. the representation 

unitarily induced from the character a(y) 1— > Lem. 111.21 provides W v in 

the Whittaker model of ir v , W2, v in the Whittaker model of tt2, v , and a Schwarz 
function ^i }V with \I V (W V , W2, v , ^i,v, 1/2 + to) I > 1- The last comment of Rem. 
110.21 shows that there is a standard ^ 2 ,v so that W^ 2v (l/2 + t' a ,g v ) = W2 V (g v ) 
(notation of Lem. 110. 5[> . Moreover, Lemma [11.21 also shows that and W2 „ 
(so also ^2,v) may be chosen from a fixed finite set of possibilities (depending, of 
course, on the original bounded set to which tt^ belongs, as well as t$ and t' ). 

Again we put = Y[ v ^i,v f° r i = 1,2 and take ip with f xeF \ AF ( p(n(x)g) = 
n v W v (g). From (|11.10|) we deduce that, with our choices, |<J>(t ,*o)l > 1- Tlic 
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assertion about HvlU 2 follows as in the proof of the previous Lemma. The second 
assertion of the Lemma (concerning invariance of Vf^) is immediate. 

It remains to prove that $ is actually holomorphic in (t, t') and that t')\ <C 
(1 + |t| + WfeCimi+CMtX Put Ey = L ^ +t+t ,J L ^ +t _ t ,^ . It is simple 
to explicitly compute 2^ for nonarchimcdean v, using Cor. 110.11 and Lem. 111.11 
One thereby sees that it will suffice to check, by similar arguments to those used in 
Lem. 111.21 the following statement for v archimedean: E v = c s c' s P(s,s'), where 
P is a polynomial, and moreover c, c', P vary continuously in tt v , if n v i— > W v is a 
continuous assignment. We only sketch the proof of this. From (|11.7fl and the fact 
that W v ,^i yV ,^2,v are all i^-finite, it suffices to prove the corresponding assertions 
for J pX W v (a(y))W^ 1 v (s, a(y))\y\ s ~ 1 d x y. For this we use Barnes' formula as in 

na - ■ □ 



11.3. Local Rankin-Selberg convolutions. Let v be a nonarchimcdean place 
of F with residue characteristic q v . Let tt\ , 7T2 be generic irreducible admissible 
representations of GL(2, F„) with trivial central character. (Since we shall work 
purely locally over F v throughout the present subsection, we shall use the notation 
7Ti rather than e.g. 7T1,,,). 

Then tti , 7T2 are self-dual. We assume that 7r 2 is unratified and tti has conductor 
q v , and denote by L(s,iTj) the local L-factors. 

Fix once and for all an additive unramified character ip of F v . Let v : F* — * Z be 
the valuation, put 0f v = {x G F* : v(x) > 0}, and choose a uniformizer w £ F* . 
Let o F be the multiplicative group of units in 0p v - Let d x x, dx be Haar measures 
on F*,F V respectively, assigning mass 1 to o F and 0f v respectively. For x £ F v , 

put n(x) = y q 1 ) ' W = \ ^1 ) ' ^ 6 C ^ 00Se a Whittaker model 

for 7Ti transforming by the character n(x) i— > tp(x), and a Whittaker model for 1T2 
transforming by the character n{x) 1— ► ip(x). 

Let be the characteristic function of . Set W\ to be the new vector in the 
Kirillov model of tti , let W% be the new vector in the Kirillov model of 7r2 , and set 

W2 — 7r 2 ( ( q ^ ^ • Then both Wi, W2 are invariant by the subgroup 

(11.11) ifo = {^" ^ J : a, b, d e 0f v , c £ vdof v }- 

Moreover W2 is invariant by n(nj^ 1 0F v )- 

Lemma 11.6. Notations being as in ill. dp . let L(s,ni x 7^) be the local L-factor. 
Then: 

I V (Wl, v ,W2,v,*v) _ ± _ «i 



L(s,lTi X 7T 2 ) q v + 1 

Proof. We shall often use the following shorthand: if W is a function in the Whit- 

/ z 

taker model of 7r S {^1,^2} and for z £ F* , we write W(z) for M / ( 



x 1 

Thus, for instance, W2O2) = W£ izw~ v ). The function z t—> iy(z) belongs to the 
Kirillov model of 7r. 
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Let e € {—1,1} be the local root number of m (it lies in { — 1,1} since m is 
self-dual). Then: 

f W 1 (a)\a\ s - 1 / 2 d x a = L(s,ir 1 ), 

(11.12) JaeF * 

/ W 2 (a)\a\ s -V 2 d x a = q-^ 1 / 2 ) L(s, ir 2 ), 

as follows from defining properties of newforms and the fact W 2 (z) = (zw^ 1 ); 
moreover 

f 7r 1 («7)^i(a)|ar 1/2 d x a = e (7 ( s - 1 / 2 )L( S ,^ 1 ), 

(11.13) ae ^ X 

K2{w)W 2 {a)\a\'- l ' 2 d*a = g^ 1/2) L(s,n 2 ), 

as follows from local functional equation for the standard L-function on GL(2): see 
[H 2.18]. 23 

Note moreover that W\, W 2 , iri(w)Wi, and ir 2 (w)W 2 are all invariant by the 
maximal compact subgroup of the diagonal torus of GL2 . Thus 1)11.12(1 and 1)11.13(1 
completely determine their restriction to the diagonal torus; we now explicate this. 

Choose aeCso that L(s, 7Ti) = (1 — aq~ s )~ 1 . In fact, a = —eq^ 1 ^ 2 , by 16, 
Prop. 3.6]. Choose 71, 72 so that i(s, 7r 2 ) = ((1 - 7iq t 7 s )(l — 72q^ s )) _1 . Recalling 
the notational convention established in the paragraph prior to 1(11.12(1 . we see: 
(11.14) 

ur I n )a r qy r/2 ,r>0 J ea r+1 q~~ , r > -1 

[0,r<0 [0,r<-l 

{0,r < 
l,r= 1 
(ll- 1 + 7i~ 2 72 + • • • + iT 1 )^ ,r > 2 
( 0,r < -1 
7r 2 HVK 2 (ru r ) = J l,r = 1 

I (7i r+1 + 7x72 + • ■ • + 7 2 r+1 )9« ^ > r > « 

The local integral we wish to evaluate is the right hand side of ((11. 6|) . In the case 
at hand, with N, G, Z denoting the i^-points of the respective groups, we have: 



dg 



(11.15) !(*):=/ W 1 (g)W 2 (g)\det(g)\ s ( f^ v ((0 1 t)-g)\t\ 2s d x t 

JZN\G \Jt 

Using the Iwasawa decomposition, and recalling ty v was the characteristic func- 
tion of o 2 one finds: 

I(s) = (1 - q- 23 )- 1 f ir 1 {k)W 1 {a)7r 2 (k)W 2 {a)\a\ s - 1 d x adk 1 

JAxKv 



23 That is, / agFX W[p)\a\-V*d*a = .(.^ffi,,^ J FX W(a W )[a| ^-'^(a)^ a. 



In 



particular, if 7r is a representation with trivial central character, and \ a character of F x , 
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where the measure dk is the Haar measure of total mass 1, and d x a assigns mass 

1 to AnK v . 

The function k i— > m(k)Wi(a)TT2[k)W2(a) is right invariant by Kq (see <|11.11[) 
for definition) and left invariant by N n K v . There are two (N n K v , Kq) double 
cosets in K v , and we may therefore express I(s) as a sum: 

(1 - q~ 2s )I(s) = f W 1 (a)W 2 (a)\a\ s - 1 d x a 

(11.16) Qo + Uf! 

TTi (w) Wi (a)7r 2 (w)W 2 (a) |a| s " 1 rf x a 



<7u + 1 y a&Fy 



To evaluate J(s), we use (111.1411 . Noting that L(s, 7Ti x 7t 2 ) = ^7 =y-, 

(1— c«7i«„ )(1-Q729i, ) 

an easy computation shows 

(cfo + 1)(1 - q v ) V 

from where we obtain I(s) = e q q ^ 1 L(s, tt± x n^). Note also that I(s) satisfies the 
necessary functional equation. □ 



11.4. Hecke-Jacquet-Langlands integral representations for standard L- 
functions. Our goal here is to prove Prop. 16.11 and 16.21 used in the text. This 
amounts to explicit computations connected to Hecke-Jacquet-Langlands integral 
representations. Since, in the main text, we obtain subconvexity for GL(1) twists 
of GL(2) L-functions, with polynomial dependence in all parameters, we will have 
to be somewhat more precise than in the case of Rankin-Selberg L-functions. 

Let 7r be a cuspidal representation of GL 2 over Ap. Let % be a unitary character 
of Ap/F x of finite conductor f. Put L unr (s,n x x) to be the unramified part of 
the (finite) standard L-function: 

L unr (s,ir x x) ■= Yl L v (s,ir v x Xv)- 

v finite, Xv unramified 

Define fi z as in l|6.4|) . i.e. the measure on X GL (2) denned as 

M/) = / fHyH[f]))x(y)d x y. 
J\v\=* 

We refer to Sec. 12.31 and Sec. 12.51 for notation, as well as the start of Section H3 for 
a discussion of the meaning of fi z in classical terms. 

Lemma 11.7. Let v be a nonarchimedean place of F with residue characteristic q v , 
and 7r„ an irreducible generic representation o/GL^L^,). Let i/i v be an unramified 
additive character of F v . Let Xv ■ — ► C a multiplicative character of conductor 
r, W v be the new vector in the ip v -Whittaker model of ix v . Then 

{ W v (a(y)n(w- r ))xv(y)\y\ s - 1/2 d*y = 
JyeF* 

where 9 is a scalar of absolute value q v r ^ 2 (l — q^ 1 ) 



L v {s 1 i: v x Xv), r 
0, r > 1, 



SPARSE EQUIDISTPJBUTION PROBLEMS, PERIOD BOUNDS, AND SUBCONVEXITY 85 



Proof. If r = 0, then Xv is unramified, the result follows immediately from the 
definition of the new vector. Otherwise, Xv is ramified, and we rewrite the integral 
under consideration as 

(11-17) / w v (a( y ))M^v r y)xv(y)\y\ s - 1/2 d x y. 

JyeF* 

Now W v (a(y)) vanishes when v(y) < and it is o F -invariant. The integral 
J v ^ y - j _ k Xv(y)' l Pv('^v r y)d x y is nonvanishing only when k = 0. In that case, it is 

a Gauss sum with absolute value , g " _ u , where the factor (1 — g" 1 ) -1 arises from 

(1 — q v J \ j-v / 

the measure normalization (cf. Sec. 12.6(1 namely J v ^ =0 d x y = 1- The result 
follows. □ 

Lemma 11.8. Let d,f3>0. Then there exists ip <E ir such that, with 

(11.18) 4W=N (f)^A^!f!!^^ 

L mr (s,ir x x) 

then $(s) is holomorphic and satisfies: 

(1) |$( S )| « SR(s)>e N(f) e and |$(i)| » £ N(f)~ e . 

(2) 93 is new at every finite place (i.e., for each finite prime q it is invariant by 
Ko[c{ Sq ], where s q is the local conductor of the local constituent ir q ). 

(3) The Sobolev norms of ip satisfy the bounds (conductor notation as in Sec. 

mm 

(11.19) S iidtP (<p) « £ Cond 00 ( 7 r) 2d+e Cond / (7r)' 3+e Cond 00 (x) 1/2+2d 

Proof. For each infinite place w of F, denote by Cond w (x) the contribution from 
w to the Iwaniec-Sarnak analytic conductor of x (see Sec. 12. 12.21 ) 

The map ip 1— > W v = f F \ Ap eF(x)tp(n(x)g) is an isomorphism between the space 
of 7r and the Whittaker model of tt. For each finite v, take W v to be a new vector 
in the Whittaker model of ir v . A point of caution is that e p may not be unramified 
on F v ; to be absolutely concrete, we set W v (g) — W v , ncw (a(u7 dv )g), where W U)Ilew 
is the new vector in the Whittaker model of ix v taken w.r.t an unramified additive 
character of F v , and d v = v(d) is the local valuation of the different. 

Let us now choose W v at the infinite places. Let g\ be a smooth positive function 
of compact support on F v . Let deg(u) = 2 if v is complex and deg(u) = 1 if v is 
real. For oo\v, define 

W v (y) = Cond v (x)gi(Cond v (x) 1/dcsiv) (y - !))• 

This is possible by the theory of the Kirillov model; thus W v is a smooth (but 
not K v -fLiiite) vector. In words, if v is real, the function W v is supported in a 
neighbourhood of the identity of size Cond 1 ,(x) _1 and takes values of size |Cond„(x)| 
there; if v is complex, a similar statement holds but now W v is supported in a disc 
around the identity with area Cond 1 ,(x) 

Then there exists ip E tt with W v = Yl v Wv By unfolding, it follows that for 
> 1 

(11.20) / Mz MM s - 1/2 rf x z = c F n / W v (a(y)n([f\))\y\ S - 1/2 Xv(y)d x y 
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Here cf is a constant depending only on F, arising from change of measure; it is 
entirely unimportant as we will be only interested in bounds. 24 
By Lem. ITTT1 with $(s) as in the statement of the Lemma, 

(n.21) $( S ) = C /-N(r 1/2 n J x w v (a(v))\yr 1/2 xv(y) 

infinite v ^ v 

where \0'\ = IIq|f(l ~ For this choice of tp, the second assertion if the 

Lemma is clear, and, if we choose the support of g± to be small enough, the first 
assertion also follows easily. 25 

<|11.19[l follows from Lem. 18.41 together with Lem. 111.31 and the upper bound 
for L-functions near 1 due to Iwaniec. See ^3 Chapter 8] for this bound. □ 

The previous Lemma shows that L(l/2, 7T x x) may be "well- approximated" by an 
appropriate period integral. Unfortunately, this period integral is against a measure 
of infinite mass, since A F /F X is of infinite volume. It is, therefore, convenient to 
know that the /z z -integral of (|11.18(l can be truncated to a compact range without 
affecting the answer too much. This is, roughly speaking, the geometric equivalent 
of the approximate functional equation in the classical theory, and is provided by 
the next Lemma. It says, roughly speaking, that the integral of (|11.18fl can be 
truncated to the range where z is around N(f) _1 . 

Lemma 11.9. Let notation be as in Lem. 1 1 Let g+,g~ be positive smooth 
functions on R>o such that g+ + g- = 1, g+{t) = 1 for t > 2 and g~(t) — 1 for all 
t < 1/2. Then ~ 

fi z (<p)g + (z/T)d*z « fl+ , e (N(f)T)- 1 / 2 (TCond(7r)Cond( X )) e 



/_ := / ^)g^(z/T)d x z^ g _^ (N(f)T) 1 / 2 (rCond( X )) £ (Cond 00 (x)Cond(^)) 1+e 

J z 

Proof. Recall the definition of \i z from 16.4J1 . Put g±(s) = J g±(x)x s ~ 1 dx, the 
Mellin transform of g±; then g± is holomorphic in ±5R(s) < and for any M > 
0,±cr < the integral j^, s s_ CT |ff±(s)|(l + |s|) M ds is convergent. Then, for any 
±<7 > 0, we have, by the Plancherel formula on R x , that: 

^ z ( V )g ± (z/T)d x z=^-J^ _ (^j Hz(v)\zr s d x z^j T s g^(s)ds. 

So for any M > 0, 

\t \ T-^ivrm- 1 / 2 Q „T, \L unr (s)$(s)\ 

U(s) = l/2+o I 1 + \ s \) 

where $ is as in the previous Lemma. Take a = 1/2 + e in the + case, —1/2 — e in 
the — case. 



24 The measure [i z is normalized as a probability measure, whereas to unfold from AS to 
Tiv we use the measures previously set up there (see Section l2.til . 

2 ^For the assertion concerning the lower bound for |$(1/2)|, the point, in words, is that our 
choices are so that Xv does not oscillate over the support of W v , cf . Remark 12.21 Note how 
convenient it is, here and elsewhere, to use smooth vectors rather than Xoo-finite vectors; one 
could not e.g. achieve W v of compact support with ftfoo-finite vectors. 



SPARSE EQUIDISTRIBUTION PROBLEMS, PERIOD BOUNDS, AND SUBCONVEXITY 87 



Using Iwaniec's bounds on L-functions near 1 |15l Chapter 8] and the functional 
equation, we see that for sufficiently large M: 

(11.22) sup \L unr (s,n x x )\ < Cond(7r<g) X f 

K(s) = l+e 

sup SRW= _ e |-L„ nr (s,Trxx)| „ ,. , 1/2+s -pr ._ . 

tz —j hM <m Cond(7r®x) 7 sup Z„(s, tt v xxv 

For each v where Xv is ramified and L v (s,ir v x is not identically 1, the rep- 
resentation ir v must also be ramified (i.e., not spherical). So one can bound the 
product on the second line on (| 1 f ,22|l . using trivial bounds towards the Ramanujan 
conjecture, by Cond(7r) 1 / 2+2£ . The fact that Cond(x) = Condoo(x)N(f), the bound 
H>| Cond(7r (g) x) ^ Cond(7r)Cond(x) 2 , and the (easily verified) analogue of this 
bound of [H] at archimedean places, allows one to conclude. □ 
We now address the analogue of the previous Lemmas when it is noncuspidal. 

Lemma 11.10. Let sq,s' G C. There is an absolute C > (i.e., depending only 
on F) and a Schwarz function VP (depending on X ) so that if we put 

f ci x /PX E^(s,a(y)n(\f}))x(y)\y\ s ' d x y 

[ ' J ' W H^s + s'MxA-s + s') 

where E is defined as in \10.9\) . then the integral defining <£> is absolutely convergent 
in a right half-plane Si(s) 1. Moreover, $ extends from 5ft(s), 5R(s') 3> 1 to a 
holomorphic function on C 2 , satisfying 

(1) |$(1/2,Q)| > 1 and < C^+^WI+W^Cl + |s| + \s'\) c . 

Moreover, given N > we have that 

(11.23) + 1*1 + k'l)^ <X( S )M S '),N Cond co ( X ) A '' 

where N' and the implicit constant may be taken to depend continuously on 
N,3t(s),$i(s'). 

(2) ^ , and so also E<s,(s,g) is invariant by K max ; 

(3) Let h € H(k) be as in ^lU.18j) . and put Eh := j^/ s \-^i h(s)E-q,(s, g)dg. For 
each d, (3 there is N such that S OO: d l 0(Ef l ) <§C K |j/i||oCond 00 (x)'' V - 

Proof. We shall not explicitly address details of convergence. The manipulations 
that follow may be justified by similar reasoning to that of Lem. 110.61 

We now define a Schwarz function ^> v on F 2 for each place v. For each finite 
place v, let ^5> v be the characteristic function of o 2 . 

For infinite v, we will first define a Schwarz function p v on F v , and then take 
ty v (x, y) — p v (x)pv(y); here p v is the inverse Fourier transform of p v , satisfying 

I F Pv(y)e Fv ( x y) d y = Pv(x). 



2 ^The content of the following Lemma, in classical language, is related to the following obser- 
vation. Let x De an even Dirichlet character mod q, and let E* (s, z) to be the Eisenstein series 
of GHS), and E*(s, z) := E*(s, z) - ^2s)y' - {(2(1 - s))?, 1 " 3 , then I / °° Ei< x < q -i E*(s, f + 

iy)y s d x y coincides, up to some harmless factor, with q~ 1 ^ 2 A(x< s + s ')^(Xi 1 ~~ s + where 
A(x, s) is the usual Dirichlet //-function completed to include the T-factor at oo. This particular 
expression is actually not quite suitable for our needs, because of the rapid decay of the T-factor 
swamps information about the finite L-function, and in fact the Lemma uses (the equivalent of) 
a different test vector belonging to the automorphic representation underlying E*(z,s). 
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Let gi be a smooth positive function of compact support on F v . Let deg(v) = 2 
if v is complex and deg(i>) = 1 if v is real. For oo\v, define 

Pv (y) = Cond v ( X ) gi (Gond v (x) 1/dcsiv) (y ~ !))• 

In words: in the real (resp. complex) case, p v is localized in a real (resp. com- 
plex) interval (resp. disc) around 1, of length (resp. area) Cond^x) -1 - Now put 
~$f v (x,y) = p v {x)p v {y). The function *S> V is not compactly supported; however, it is 
of rapid decay. Indeed for each Schwarz norm S, there is M > such that 

(11.24) 5(* t ,)«Cond l ,( X ) M 

Define a Schwarz function on A F via ^(x,y) — J\ ^v{x,y). Define Wy(s,g) as 
in Lem. 110.51 to be the Fourier coefficient of E^(s,g). The choice of \1/ and Lem. 
110.51 shows that W^(s, g) — \\ v W v (g), where, for each finite v, W v is given by Cor. 
110.11 and satisfies 

(11.25) / W v {a{y))\y\ S 'd x y = q d v ^ 1+s '- s) L v {\ ■ \ s ,s')L v (\ • ^V), 



For infinite v, W v satisfies (Lem. 110.50 
(11.26) 

W v {a{y))u(y)\y\ s 'd x y= [ Pv (x)uj(x)\x\ s+s ' d x x I p^x)^)^ 1 -^ 8 ' d x x 



By Fourier analysis and Lem. 110.31 Ey(s,g) — J2 a eF x W^{a{a)g). Thus, for 
»(s)»l, 

(11.27) 

E*(s,a(y)n([f]))x(y)\y\ s 'd*y = [ W*{s, a(y)n([f}))x(y)\y\ s ' d x y 



= n/ W v (a(y)n([f])) X (y)\y\ s 'd*y 
v JyeF* 

For s ^> 1, we use <|10.17ll . <|11.26[l and Lemma Til .71 to evaluate the local factors, 
obtaining: 

(11.28) <&( s , s ') = tf'-N(c.) 1+s '- s Y[ I W v (a(y))x(y)\y\ s 'd x y 



v infinite ** V^^v 



where \6'\ = IIq|f(l — N(q) x ) 1 . Now, by choice of ip v , the integral I v (s) := 
f yeF * Pv(y)x(y)\y\ s d x y satisfies 14(1/2)1 » 1 and \I v (s)\ « (1 + \s\) c C l +^, 
at least when we choose the support of g\ to be sufficiently small. It also satisfies 
|/„(s)|(l + \s\) N 4Cjv,jt(s) Cond 00 (x) Ar , where N' and the implicit constant may be 
taken to depend continuously on N, 3?(s). 

The corresponding facts (i.e. the first assertion of the Lemma) about $ follow 
immediately. The second assertion of the Lemma is immediate from our choice of 
tf. 

As for the third and final assertion, it follows from Rem. 110.31 and (|11.24fl . 

□ 
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Lemma 11.11. Let notations be as in the previous Lemma. Assume X is ramified 
at at least one finite place. Let g+,g- be positive smooth functions on K>o such 
that g + + g_ = 1, g + (t) — 1 for t>2 and g_(t) = 1 for all t < 1/2. 
Then 

(11.29) fi z (E h ) min(z K 7 z- K ) 

for any K > 1. 

Moreover, there is an absolute N > such that 

/+:= f v z (E h )g + (z/T)d*z^m)T)- 1/2 (TCond( X )y\\h\\ N 



/_ := / Al2 (£;, i ) 5 _(z/r)d x z« (N(f)T) 1 / 2 (TCond( X )) £ Cond co (x) 1+ 1/ l |k 

•/ z 

(Here the norms \\ ■ ||jv are as in 110. Iffy .) 

Proof. Again, we shall leave verification of convergence to the reader. Recall that, 
with the relevant measure on A F /F X having mass 1: 

(11.30) Vz(E h )= f E h (a(y)n[f}) X (y)d x y 

JyeA*/Fx,\y\=z 

f h( S )E*( S ,a(y)n(mx(y)d x y 

y£A*/F*,\y\=z 



h(s)E^(s,a(y)n([f])) X (y)d x y 

yeA*/Fx,\y\=z ^5R(s)>l 

Here, the last equality is justified by the fact that — E<s,)(s, a(y)n([f])) is invari- 
ant under y t— > yy', for y' e fl v o*. On the other hand, \ is nontrivial on J\ v 0$ > 
by assumption. 

Combining (|11.3U|) with Lem. 111.101 we have 
(11.31) 

fi z (E h )\z\ s 'd x z = CFN(f)- 1 / 2 f h(s)L( X , l-s + s')L( X , s + s')$(s, s')ds. 

J5R(s)»l 

Here cf is an (unimportant) constant arising from measure normalization, as in 
(tTT^TO . 

The assertion (|11.29|) follows immediately from this, inverse Mellin transform, 
and analytic properties of the right-hand side. 

Now proceed as in Lem. 111.91 it follows that (for any M) 

| /± | <<T T(l/2+e) N(f) -l/2 sup (1 + | S '|)-^ f h(s)L( X ,l-S+s')L(x,S+s'Ms 

5R(s') = ±(l/2+e) J 

We deal with the case of J_ . In that case, we take the inner integral to be over 
Sft(s) = 1/2, and put s' = -1/2 - e - it', and it will suffice to bound / h(l/2 + 
it)L(x, -e - it - it')L(x, -e + it - it')(l + \t\ + |i'|) c . This is bounded, up to an 
implicit constant depending on e, by Cond(x) 1+2e ||fr|Uf'(l + \t'\) C f° r sufficiently 
big M' , C, whence the result. □ 



90 akshay venkatesh 

References 

[1] Joseph Bernstein and Andre Reznikov. Sobolev norms of automorphic functionals and Fourier 
coefficients of cusp forms. C. R. Acad. Sci. Paris Ser. I Math., 327(2):111-116, 1998. 

[2] Joseph Bernstein and Andre Reznikov. Sobolev norms of automorphic functionals. Interna- 
tional Math Research Notices, 40 (2155-2174), 2002. 

[3] Joseph Bernstein and Andre Reznikov. Periods, subconvexity of L-functions and representa- 
tion theory, arxiv: math.RT/0504411 

[4] Jean Bourgain. Pointwise ergodic theorems for arithmetic sets. Inst. Hautes Etudes Sci. 
Publ. Math., (69):5-45, 1989. With an appendix by the author, Harry Furstenberg, Yitzhak 
Katznclson and Donald S. Ornstein. 

[5] Jean Bourgain and Elon Lindenstrauss. Entropy of quantum limits. Comm. Math. Phys., 
233(1):153-171, 2003. 

[6] C. Bushnell and G. Henniart. An upper bound on conductors for pairs. J. Number Theory 
65: 183-196, 1997. 

[7] L. Clozel and E. Ullmo. Equidistribution de mesures algebriques. preprint. 

[8] M. Cowling, U. Haagerup, and R. Howe. Almost I? matrix coefficients. J. Reine Angew. 

Math., 387:97-110, 1988. 
[9] P. Cohen Hyperbolic distribution problems on Siegel 3-folds and Hilbert modular varieties. 

Duke Math. Journal, to appear. 
[10] W. Duke, J. Friedlander and H. Iwaniec. Class group L-functions. Duke Math. Journal 79: 

1-56, 1995. 

[11] L. Flaminio and G. Forni. Invariant distributions and time averages for horocycle flows. Duke 

Math. Journal, 119: 465-526, 2003. 
[12] J. Friedlander and H. Iwaniec. A mean-value theorem for character sums. Michigan Math. 

J., 39(1):153-159, 1992. 

[13] Anton Good. Cusp forms and eigenfunctions of the Laplacian. Math. Ann, 255: 523—438, 
1981. 

[14] D. Hejhal. On the value distribution properties of automorphic functions along closed horocy- 

cles. XVI Rolf Nevanlinna Colloquium, I. Laine and O. Martio (ed), deGruyter 1996, 39-52. 
[15] H. Iwaniec. Introduction to the spectral theory of automorphic forms. Revista Matematica 

Iberoamericana, Madrid, 1995. 
[16] H. Jacquet and R. P. Langlands. Automorphic forms on GL(2). Springer- Verlag, Berlin, 1970. 

Lecture Notes in Mathematics, Vol. 114. 
[17] Herve Jacquet. Automorphic forms on GL(2). Part II. Springer- Verlag, Berlin, 1972. Lecture 

Notes in Mathematics, Vol. 278. 
[18] D. Y. Kleinbock and G. A. Margulis. Logarithm laws for flows on homogeneous spaces. Invent. 

Math., 138(3):451-494, 1999. 
[19] Serge Lang Algebraic number fields. 
[20] Yu. Linnik. Ergodic properties of algebraic fields. 

[21] Gregory Margulis. On some aspects of the theory of Anosov systems. Springer monographs 
in mathematics, 2004. 

[22] Philippe Michel. The subconvexity problem for Rankin-Selberg L functions and equidistribu- 
tion of Heegner points. Annals of Math. 160:185-236, 2004. 

[23] Philippe Michel and Akshay Venkatesh. Periods, equidistribution and subconvexity. In prepa- 
ration. 

[24] Philippe Michel and Akshay Venkatesh. Equidistribution, L-functions and ergodic theory: on 

some problems of Yu. V. Linnik. Proceedings of the Madrid ICM, to appear. 
[25] Marina Ratner. Rigidity of time changes for horocycle flows. Acta. Math. 156, 1—32, 1986. 
[26] Marina Ratner. Raghunathan's conjectures for SL(2,R). Israel. J. Math 80, 1—31, 1992. 
[27] Marina Ratner. The rate of mixing for the geodesic and horocycle flows. Ergodic Theory and 

Dynamical Systems 7, 267-288, 1987. 
[28] Andre Reznikov. Rankin-Selberg without unfolding. Preprint available at 

www .math. biu. ac . il/ "reznikov/publications .html. 
[29] Peter Sarnak. Fourth moments of Grossencharakteren zeta functions. Comm. Pure Appl. 

Math., 38(2):167-178, 1985. 
[30] Peter Sarnak. Integrals of products of eigenfunctions. Internat. Math. Res. Notices, (6):251 

ff., approx. 10 pp. (electronic), 1994. 



SPARSE EQUIDISTRIBUTION PROBLEMS, PERIOD BOUNDS, AND SUBCONVEXITY 91 

[31] Peter Sarnak. Asymptotic behavior of periodic orbits of the horocycle flow and Eisenstein 

series. Comm. Pure Appl. Math, 34: 719-739, 1981. 
[32] Nimish A. Shah. Limit distributions of polynomial trajectories on homogeneous spaces. Duke 

Math. J., 75(3): 71 1-732, 1994. 
[33] Goro Shimura. Introduction to the arithmetic theory of automorphic forms. 1971. 
[34] Kannan Soundararajan. Nonvanishing of quadratic Dirichlet L-functions at s = i. Ann. of 

Math. 152 (2), 447-488, 2000. 
[35] Andreas Strombergsson. On the uniform equidistribution of long closed horocycles. Duke. 

Math. Journal 123: 507-547, 2004. 
[36] Terence Tao. The ergodic and combinatorial approaches to Szemeredi's theorem. 
Imath. CO/ 0604456 . 

[37] Jean-Loup Waldspurger. Quelques proprietes arithmetiques de certaines formes automorphes 
sur GL(2). Compositio. Math., 54: 121-171, 1985. Sur les valeurs de certaines fonctions L 
automorphhes en leur centre de symetrie. Compositio. Math. 54: 173-242, 1985. 
[38] Vinayak Vatsal. Uniform distribution of Heegner points. Invent. Math. 148 (1): 1-46, 2002 
[39] S. Zhang Equidistribution of CM points on quaternion Shimura varieties. Preprint. 



